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Abstract 

The Generalized Second Price (GSP) auction is the primary auction used for monetizing the use of 
the Internet. It is well-known that truthtelling is not a dominant strategy in this auction and that inefficient 
equilibria can arise. Edelman et al. and Varian show that an efficient equilibrium always exists in the full 
information setting. Their results, however, do not extend to the case with uncertainty, where efficient 
equilibria might not exist. 

In this paper we study the space of equilibria in GSP, and quantify the efficiency loss that can arise 
in equilibria under a wide range of sources of uncertainty, as well as in the full information setting. 
The traditional Bayesian game models uncertainty in the valuations (types) of the participants. The 
Generalized Second Price (GSP) auction gives rise to a further form of uncertainty: the selection of 
quality factors resulting in uncertainty about the behavior of the underlying ad allocation algorithm. The 
bounds we obtain apply to both forms of uncertainty, and are robust in the sense that they apply under 
various perturbations of the solution concept, extending to models with information asymmetries and 
bounded rationality in the form of learning strategies. 

We present a constant bound (2.927) on the factor of the efficiency loss {price of anarchy) of the 
corresponding game for the Bayesian model of partial information about other participants and about ad 
quality factors. For the full information setting, we prove a surprisingly low upper bound of 1.282 on 
the price of anarchy over pure Nash equilibria, nearly matching a lower bound of 1.259 for the case of 
three advertisers. Further, we do not require that the system reaches equilibrium, and give similarly low 
bounds also on the quality degradation for any no-regret learning outcome. Our conclusion is that the 
number of advertisers in the auction has almost no impact on the price of anarchy, and that the efficiency 
of GSP is very robust with respect to the belief and rationality assumptions imposed on the participants. 
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1 Introduction 



The sale of advertising space on the Internet, or AdAuctions, is the primary source of revenue for many 
providers of online services and corresponds. According to a recent report [10], $25.8 billion dollars were 
spent in online advertisement in the US in 2010. The main part of this revenue comes from search advertise- 
ment, in which search engines display ads alongside organic search results. The success of this approach is 
due, in part, to the fact that providers can tailor advertisements to the intentions of individual users, which 
can be inferred from their search behavior. A search engine, for example, can choose to display ads that 
synergize well with a query being searched. However, such dynamic provision of content complicates the 
process of selling ad space to potential advertisers. Each search query generates a new set of advertising 
space to be sold, each with its own properties determining the applicability of different advertisements, and 
these ads must be placed near-instantaneously. 

The now-standard mechanism for resolving online search advertisement requires that each advertiser 
places a bid that represents the maximum she would be willing to pay if a user clicked her ad. These bids 
are then resolved in an automated auction whenever ads are to be displayed. By far the most popular bid- 
resolution method currently in use is the Generalized Second Price (GSP) auction, a generalization of the 
well-known Vickrey auction. In the GSP auction, there are multiple ad "slots" of varying appeal (e.g. slots 
at the top of the page are more effective). In two seminal papers Edelman et al. (H and Varian ||25| propose 
a simple model of the GSP auction that we will also adopt in this paper. They observe that truthtelling is 
not a dominant strategy under GSP, and GSP auctions do not generally guarantee the most efficient outcome 
(i.e., the outcome that maximizes social welfare). Nevertheless, the use of GSP auctions has been extremely 
successful in practice. This begs the question: are there theoretical properties of the Generalized Second 
Price auction that would explain its prevalence? Edelman et al. [8] and Varian ll25l provide a partial answer 
to this question by showing that, in the full information setting, a GSP auction always has a Nash equilibrium 
that has same allocation and payments as the VCG mechanism - and therefore is efficient. 

While these results provide important insight into the structure of the GSP auction, we will argue that 
the Generalized Second Price auction is best modeled as a Bayesian game of partial information. Modeling 
GSP as a full information game assumes that each auction is played repeatedly with the same group of 
advertisers, and during such repeated play the bids stabilize. The resulting stable set of bids is well modeled 
by a full information Nash equilibrium. However, the set and types of players can vary significantly between 
rounds of a GSP auction. Each query is unique, in the sense that it is defined not only by the set of keywords 
invoked but also by the time the query was performed, the location and history of the user, and many other 
factors. Search engines use complex machine learning algorithms to select the ads, and more importantly to 
determine appropriate quality scores (or factors) for each advertiser for a particular query, and then decide 
which advertiser to display. This results in uncertainty both about the competing advertisers, and about 
quality factors. We model this uncertainty by viewing the GSP auction as a Bayesian game, and ask: what 
are the theoretical properties of the Generalized Second Price auction taking into account the uncertainty 
that the advertisers face? 

Our answer to this question is based on a quantitative understanding of the inefficiencies that can arise 
in GSP auctions, using a metric known as the Price of Anarchy. We show that the welfare generated by the 
auction in any equilibrium of bidding behavior is at least a ^ -fraction of the maximum achievable welfare 
(i.e., the welfare the auction could generate knowing the player types and quality factors in advance). The 
value of 7] measures the robustness of an auction with respect to strategic behavior: in the worst case, how 
much can strategic manipulation harm the social welfare. The closer rj is to 1, the more robust the auction 
is. An auction that always generates efficient outcomes at equilibrium would have price of anarchy equal 
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to 1. 

For mechanisms that are not dominant strategy truthful, like GSP auctions, price of anarchy analysis 
is a powerful tool for quantifying the potential loss of efficiency at equilibrium. We conduct this analysis 
both in a full information setting without uncertainty (in which the price of anarchy is surprisingly small, 
indicating a loss of at most 22% of the welfare), but also in a setting with uncertainty and a very general 
information structure, in which we prove that the price of anarchy is still bounded by a small constant. One 
should note that, remarkably, the welfare loss of these auctions is bounded by a value that does not depend 
on the number of players, the number of advertisements for sale, or the prior distributions on player types. 

One feature of our results is that they hold for a variety of models regarding the rationality and the beliefs 
of the players. This robustness is particularly important in large-scale auctions conducted over the Internet, 
where assumptions of full information and/or perfect rationality of the participants are unreasonably strong. 

The GSP auction and sources of uncertainty. By far the most popular auction method currently in use 
for search ads is the Generalized Second Price (GSP) auction, a generalization of the well-known Vickrey 
auction. The GSP auction is invoked every time a user queries a keyword of interest; it is a repeated auction 
in which players repeatedly bid for ad slots. However, modeling equilibrium strategies in a repeated game 
of this nature is notoriously difficult, and results in a game with a plethora of unnatural equilibria due to 
the possibility of bids representing threats for future rounds, optimal exploration of the bidding space, and 
so on. A common simplification used in the literature is to focus on auctions for a single keyword, and to 
suppose that players will quickly learn each others' valuations and reach a stationary equilibrium. Under 
this assumption, the stationary equilibrium would correspond naturally to a Nash equilibrium in the full 
information, one-shot version of the GSP auction J9J. It has therefore become common practice to study 
pure, full information equilibria of the one-shot game, as an approximation to expected behavior in the more 
general repeated game Il8l l25ll22l . 

In reality, however, the set and types of players can vary significantly between rounds of a GSP auction: 
each query is unique, in the sense that it is defined not only by the set of keywords invoked but also the 
time the query was performed, the location and history of the user, and many other factors. This context is 
taken into account by an underlying ad allocation algorithm, which is controlled by the search engine. The 
ad allocation algorithm not only selects which advertisers will participate in an auction instance, but also 
assigns a quality factor to each advertiser. As a first approximation we can think of the quality factor as 
a score that measures how likely that participant's ad will be clicked for that query. These quality factors 
are then used to scale the bids of the advertisers. These scaled bids are known as effective bids, which 
can be viewed as bids derived from a similarly-modified effective type. Under our assumption that quality 
factors measure clickability, the effective type of an advertiser is the expected valuation of displaying the ad 
(valuation of the ad times its likelihood of getting a click). The effective bid and effective type of a player are 
therefore random variables, which can be thought of as the original valuations multiplied by quality scores 
computed exogenously by the search engine. Athey and Nekipelov HI point out that the uncertainty in 
quality factors produces qualitative changes in the structure of the game. Thus, even if players converge to a 
stationary bidding pattern, the resulting equilibrium cannot be described as the outcome of a full information 
game. 

We model the uncertainty about the effective types of advertisers as a Bayesian, partial information 
game. That is, the inherent uncertainty due to context and the ad allocation algorithm can be captured via 
prior distributions over effective types, even when the true types of all potential competitors are fully known. 
The appropriate equilibrium notion is then the Bayes-Nash equilibrium with respect to these distributions. 
Our model allows arbitrary correlations between the types and quality factors. The uncertainty of ad quality 
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and allocation mostly comes from the query context, and hence is best modeled by correlated distributions 
of types and ad quality. Search engines use complex machine learning algorithms to compute quality factors 
based on all available information about the context, whose outcome is hard to predict for the advertisers. 
We assume that the advertisers are aware of the distribution of quality factors, and that the quality factors 
computed by the search engine correspond to the clickability of the ad. Our results deteriorate gracefully if 
the outcome of the machine learning algorithm is not exact, but rather gives only an approximation to the 
clickability of an ad. 

Summarizing, there are two main sources of uncertainty: the first is about the quality factors that the 
search engine attributes to each advertiser and the second is about the valuations (types) of the players. 
These sources are different in nature: each advertiser has knowledge of (and can condition her behavior on) 
her own type, whereas quality factors are fully exogenous and are only revealed ex post. 

Asymmetric information. There are different types of players in advertising markets, which may have 
differing levels of information about their competitors. We assume all players know their own valuations 
correctly, but some smaller players (such as individual advertisers) might be clueless about the valuations of 
the other players and expected behavior of quality scores, while others (say bidding agencies or large com- 
panies with web advertising departments) may have a much better understanding of how individual rounds 
of the auction will proceed. Even among this latter group, different advertisers may have access to differ- 
ent information. We can model such information asymmetries by giving each player access to an arbitrary 
player-specific signal that can carry information about the effective types of the auction participants. Our 
bounds on social efficiency in the Bayesian model hold in settings with such asymmetry in information. 

Learning players. So far we have considered equilibria of the auction game. Instead of assuming that 
players who have played long enough will be in equilibrium, one can model the entire learning process 
more explicitly. One natural model is that players employ strategies that give them vanishingly small regret 
over time. Roughly speaking, such a model assumes that players observe the bidding patterns of others 
and modify their own bids in such a way that their long-term performance is at least as good as a single 
optimal strategy chosen in hindsight. This will cause the empirical distribution of the bids to converge to 
a coarse correlated equilibrium of the game. Many simple bidding strategies yield low regret, such as Hart 
and Mas-Collel's regret matching strategy |[T3l or the multiplicative weight updating strategy of |[T6l (see 
also |2|). These strategies are not necessarily in equilibrium, but capture the intuition that players attempt 
to learn beneficial bidding strategies over time. 

The players can use these standard learning algorithms to learn how to best bid given their valuation and 
signal. In other words, for each possible valuation and signal, repeated auctions allow players to learn how 
to best bid taking into account the varying bids of other players, and the uncertainty about quality factors, 
other players' valuations, and bidding strategies. We will consider the quality degradation of the average 
social outcome when all players employ strategies with small regret. Blum et al. El introduced the term 
Price of Total Anarchy for this analog of the price of anarchy. 

Approximate rationality. One of the fundamental assumptions in auction analysis is that all players are 
perfectly rational utility optimizers. However, in reality (and especially in large online settings), it is natural 
to assume that some fraction of the players participating in an advertising auction might have unsophisticated 
bidding strategies. In fact, some players may not even play at equilibrium in the single-shot approximation 
of the GSP auction, or may only be able to find strategies that are approximately utility-maximizing. We 
discuss the robustness of our bounds to the presence of players bidding with limited (or no) rationality. As 
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we shall see, the GSP auction has the property that its social welfare guarantees degrade gracefully when 
our assumptions about the rationality of the players are relaxed. 

1.1 Our results 

Our main result is a bound on the Bayesian price of anarchy for the GSP auction. Specifically, we show 
that the price of anarchy is at most 2.927, meaning that the social welfare in any Bayes-Nash equilibrium 
is at least 1/2.927 of the optimal social welfare. Notice that this is an unconditional bound, as we make no 
assumptions on the distribution on valuation profiles and quality factors (it can, for example, be correlated) 
or on the number of players or slots. 

Perhaps just as important as the bound, however, is the straightforward and robust nature of the GSP 
auction. In particular, our results extend to provide the same welfare guarantees for outcomes of no-regret 
learning: the average social welfare when players play repeatedly in order to minimize total regret, in a 
Bayesian setting, is within a 1/2.927 factor of the optimal social welfare. Also, this bound continues to hold 
even if players have asymmetric access to distributional information, in the form of exogenously provided 
signals. It also degrades gracefully in the presence of approximately rational players or a small fraction of 
irrational players. 

We achieve these bounds by identifying a property that encapsulates many of the insights that drive our 
bounds. Roughgarden [24] identified a class of games that he termed smooth games, defined via a similar 
property that is used to bound the price of anarchy. The smoothness criterion is quite strong, and does not 
apply in general to the GSP mechanism. We identify a weaker property, semi-smoothness, that is satisfied 
by the GSP auction, and is strong enough to also imply price of anarchy bounds. 

We provide improved results for the case where there is no uncertainty, which is the traditional setting 
studied in (H|25l. If valuations and quality factors are fixed, we prove that the social welfare in any pure 
Nash equilibrium is within a factor of 1.282 of the optimal one and show that this bound is essentially tight 
by providing a lower bound of 1.259. Also, we show a bound of 2.310 for coarse correlated equilibria, 
which implies the same bound on the social welfare when players with fixed (effective) types minimize their 
regret in a repeated auction. 

1.2 Related work 

Due to their central role in Internet Monetization, sponsored search auctions have received considerable 
attention in the past years. From the optimization perspective, they were first considered by Mehta et 
al. ll2~Tll . A classical game-theoretical modeling of sponsored search auctions was proposed simultaneously 
by Edelman et al. ill and Varian |[25l . See the surveys of Lahaie et al. lfT5l and Maille et al. ll20l for an 
overview of subsequent developments. 

The model we adopt follows HJ|25l. In those two seminal papers, the authors notice that even though 
truthtelling is not a dominant strategy under GSP, the full information game always has a Nash equilibrium 
that has same allocation and payments as the VCG mechanism - and therefore is efficient. They focus on a 
subclass of Nash equilibria which is called envy -free equilibria in (H and symmetric equilibria in |25ll . They 
show that such equilibria always exist and are always efficient. In this class, an advertiser would not be better 
off after switching bids with the advertiser just above her. Note that this is a stronger requirement than Nash, 
as an advertiser cannot unilaterally switch to a position with higher click-through-rate by simply increasing 
her bid. In dHHHH, informal arguments are presented to justify the selection of envy-free equilibria, but 
no formal game-theoretical analysis is done. We believe it is an important question to go beyond this and 
prove efficiency guarantees for all Nash equilibria. Lahaie lfl4l also considers the problem of bounding 
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the social welfare obtained at equilibrium, but restricts attention to the special case that click-through-rates 
decay exponentially along the slots with a factor of 4. Under this assumption, Lahaie proves a price of 
anarchy of min{ 4, 1 — 4}. 

Gomes and Sweeney iTTTTl study the GSP auction in the Bayesian setting, where player types are drawn 
from independent and identical distributions (without considering the uncertainty due to quality factors). 
They show that, unlike the full information case, there may not exist symmetric or socially optimal equi- 
libria in this model, and obtain sufficient conditions on click-through-rates that guarantee the existence of 
a symmetric and efficient equilibrium. Athey and Nekipelov 0]] study the effect of uncertainty of quality 
factors both from a theoretical and an empirical perspective. 

The study of price of anarchy for non-truthful auction mechanisms (especially in the Bayesian setting) 
was initiated by Christodoulou et al. [6] and developed in Lucier and Borodin fl"8ll , Lucier ifTTl . and most 
recently in the work of Bhawalkar and Roughgarden Q- To the best of our knowledge, this is the first paper 
in which the price of anarchy bounds hold when player valuations are drawn from a correlated distribution. 
In truthful mechanism design, the study of correlated valuations has a long history - see Cremer and McLean 
171 for an early reference. 

The study of regret-minimization goes back to the work of Hannan on repeated two-player games lfl2l . 
Many simple bidding strategies yield low regret, such as Hart and Mas-Collel's regret matching strategy lPT3l 
or the multiplicative weight updating strategy [2]. Blum et al. HI apply regret-minimization to the study of 
inefficiency in repeated games, coining the term "price of total anarchy" for the worst-case ratio between 
the optimal objective value and the average objective value when players minimize regret. 

Roughgarden [24] identifies a class of games that he terms smooth games where the price of anarchy, 
Bayesian price of anarchy, and price of total anarchy are identical. The smoothness criterion is quite strong, 
and does not apply in general to the GSP mechanism. We isolate a property related to smoothness that 
encapsulates many of the insights that drive our bounds. 

Some of the results in this paper appeared in preliminary conference versions. Paes Leme and Tardos 
ll23l study equilibria of GSP auctions and give upper bounds on the price of anarchy in pure, mixed, and 
Bayesian strategies; achieving bounds of 1.618, 4, and 8, respectively. Lucier and Paes Leme [19] and 
Caragiannis et al. improve these bounds and extend them to apply to equilibria with correlated valuations 
and learning outcomes. 

2 Model and Equilibrium Concepts 

We consider an auction with n advertisers and n slotsfl Each advertiser i has a private type V{, representing 
her valuation per click received. The sequence v = (vi , . . . , v n ) is referred to as the type profile (or valuation 
profile). We will write for v excluding the zth entry, so that v = (uj, v_j). 

An outcome is an assignment of advertisers to slots. An outcome can be viewed as a permutation 7r 
with 7r(k) being the advertiser assigned to slot k. The probability of a click depends on the slot as well 
as the advertiser shown in the slot. We use the model of separable click probabilities. We assume slots 
have associated click-through-rates a\ > 02 > ■ ■ ■ > a n , and each advertiser i has a quality factor 7^ that 
reflects the clickability of the ad. When advertiser i is assigned to the A;-th slot, she gets a^i clicks. 

A mechanism for this auction elicits a bid 6j G M + := [0, 00) from each advertiser i, which is interpreted 
as a type declaration, and returns an assignment as well as a price pi per click for each advertiser. If advertiser 
i is assigned to slot j at a price of pi, her utility is a^iiyi —pi), which is the number of clicks received times 

1 We note that we can handle unequal numbers of slots and advertisers by adding virtual slots with click-through-rate zero or 
virtual advertisers with zero valuation per click. 
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profit per click. The social welfare of outcome it is SW(ir, v, 7) = ^ . a^^^v^^y the total value of the 
solution for the participants. The social welfare also depends on the click-through-rates ay, but throughout 
the paper we will assume they are fixed and common knowledge, and as a result we suppress them in the 
notation. The optimal social welfare is OPT(v, 7) = max^ SW(ix, v, 7), the welfare generated by the 
socially efficient outcome. Note that the efficient outcome sorts advertisers by their effective values 7^, 
and assigns them to slots in this order. The effective value can be thought of as the expected value of showing 
the ad in a slot with click-through-rate equal to 1. 

We focus on a particular mechanism, the Generalized Second Price auction, which works as follows. 
Given bid profile b, we define the effective bid of advertiser i to be 7j&j, which is her bid modified by her 
quality factor, analogous to the effective value defined above. The auction sets ir(k) to be the advertiser with 
the kth highest effective bid (breaking ties arbitrarily). That is, the GSP mechanism assigns slots with higher 
click-through-rate to advertisers with higher effective bids. Payments are then set according to critical value: 
the smallest bid that guarantees the advertiser the same slot. When advertiser i is assigned to slot k (that is, 
when ir(k) = i), this critical value is defined as 

77r(fe+l) , 
Pi — »7r(fc+l) 

7i 

where we take b n+ \ = 0. We will write iij(b, 7) for the utility derived by advertiser i from the GSP 
mechanism when advertisers bid according to b: 

«i(b,7) = a 7r -l( i )7i(^i ~Pi) = 0^-1(3) [7;«j - 7 7 r(7r-l(i) + l) fo 7r( 7 r-l(i) + l)]- 

Notice that -k is a function of b, 7 as well. In places where we need to be more explicit, we will write 
7r(b,7, j) to be the advertiser assigned to slot j by GSP when quality factors are 7 and the advertisers bid 
according to b. We will also write a(h, 7, i) for the slot assigned to advertiser i, again when advertisers bid 
according to b and quality factors are 7. In other words, <r(b, 7, •) = 7r -1 (b, 7, •). We write 7r*(b_j,7, j) 
to be the advertiser that would be assigned to slot j if advertiser i did not participate in the auction. When 
b and 7 are clear from the context, we write and a(i) instead of 7r(b, 7, i) and er(b, 7, i). We will also 
write z/(v, 7) for the optimal assignment of slots to advertisers for valuation profile v, so that z/(v, 7, i) is 
the slot that would be allocated to advertiser i in the optimal assignment 

We will consider rational behavior under various models of the information available to the advertisers. 
In general, the advertisers are engaged as players in a game defined by the auction mechanism; each of them 
aims to select a bidding strategy that maximizes her utility. In the following, we use the terms advertiser 
and player interchangeably. We group our models into full information and partial information ones. In all 
models we assume that the values aj are fixed and commonly known to all players. In our full information 
settings, we assume that the quality factors 7, as well as the valuation profile v are also common knowledge. 
In our Bayesian setting of partial information, we assume that the profile of quality factors is unknown to 
all players, and the type v,- L is private knowledge known only to player i, but they are randomly drawn from 
a commonly known joint distribution (F, G) of quality factors and valuation profiles. It will turn out that 
bidding more than one's true type (overbidding) is a dominated strategy in the mechanism we consider. So, 
we will focus on non-overbidding (or conservative) players; see Section 1231 for a discussion. 

2 We note that, since GSP makes the optimal assignment for a given bid declaration, we actually have that ^(v, 7, i) and 
a(v, 7, i) are identically equal. We define v mainly for use when emphasizing the distinction between an efficient assignment for 
a valuation profile and the assignment that results from a given bid profile. 
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2.1 Full information setting 

In the full information setting, the valuation profile v and quality factors 7, are fixed and common knowl- 
edge. We will therefore tend to drop dependencies on 7 from our notation when working in the full infor- 
mation setting. In this setting, a pure strategy for player i is a bid 6, G M.+. We say that the bid profile b is 
a (pure) Nash equilibrium if there is no deviation from which the player can profit, i.e., for all £ R + , 

Ui(bi,b-i) > UiQ/^b-i). 

It is known that a pure Nash equilibrium always exists in this setting (8l|25l. We can therefore define the 
(pure) Price of Anarchy to be 

OPT(v) 

v,3 £ W(7T(b),v) 

where NE is the set of pure Nash equilibria (assuming no overbidding; see Section 1231) . 

2.2 Bayesian setting 

In the Bayesian setting of partial information, we suppose that the valuation profile and the quality factors 
are drawn from a publicly known (possibly correlated) joint distribution (F, G). A strategy for player i is a 
(possibly randomized) mapping bi : M + — > M. + , mapping her type V{ to a bid bi(v{). Notice that a player is 
not able to condition her bid on the quality factors, since they are only known to the search engine, and not 
to the advertisers. 

We write b(v) = (b\ (v\), . . . , b n (v n )) to denote the profile of bids that results when b is applied to type 
profile v. We then say that strategy profile b is a Bayes-Nash equilibrium for distributions F, G if, for all i, 
all V{, and all alternative strategies b[, 

%,- i fi,h[^(h{v i ),\i-. i {x-i),i)\vi] > E v _. |7|b [u i (6 / i (« i ),b_ i (v_ i ),7)|w i ]. 

That is, each player maximizes her expected utility by bidding in accordance with strategy &»(•), assuming 
that the other players bid in accordance with strategies b_j(-), where expectation is taken over the distribu- 
tion of the other players' types conditioned on vi, any randomness in their strategies, and the quality factors. 
We define the Bayes-Nash Price of Anarchy to be 

E v , 7 [OPT(v,7)] 
F,G,b(-)eR!V.E E v ,7,b(v) [<SWO(b(v), 7), v, 7)] 

where BNE is the set of all Bayes-Nash equilibria (again assuming no overbidding; see below). 

2.3 No overbidding 

It is important to note that, in both the full information and Bayesian settings, any bid fej > Vi is dominated 
by the bid bi = Vi in the GSP auction. If by bidding bi > Vi, the next highest effective bid is greater than 
jiVi, then the player gets negative utility. If on the other hand, the next highest effective bid is smaller or 
equal than 7^, then bidding bi = Vi would get the same slot and payment. Based on this, we make the 
following assumption for the rest of the paper: 

Assumption: Players are conservative and do not employ overbidding strategies in GSP auctions. This 
means that for pure strategies bi < Vi, for mixed strategies P(6j > Vi) = 0, and for Bayesian strategies 
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P(&i(«i) >Vi) = Oforallu i . 



We use this assumption to rule out unnatural equilibria in which advertisers apply dominated strategies. 
We remark that, in these equilibria, the social welfare may be arbitrarily worse than the optimal. It is 
therefore necessary to exclude dominated strategies in order to obtain meaningful bounds on the price of 
anarchy. We note, however, that this phenomenon is not specific to the GSP auction: such degenerate 
equilibria exist even in the Vickrey auction for a single good, where truthful bidding is a weakly dominant 
strategy. Since the Vickrey auction is a special case of GSP auctions (where one slot has a± = 1, all other 
slots have ol{ = and all quality factors have ji = 1), this issue carries over to our setting. Consider the 
example of a single-item Vickrey auction, where truthful bidding of bi = vi is a weakly dominant strategy. 
Yet with overbidding, there are equilibria where an arbitrary player with low valuation bids excessively high 
(and hence wins), while everyone else bids 0. Note, however, that this Nash equilibrium seems very artificial 
as it depends crucially on the low valuation player using the dominated strategy of overbidding. Indeed, such 
an advertiser is exposed to the risk of negative utility (if some other advertiser submits a new bid between 
her valuation and bid) without any benefit. We therefore take the position that advertisers will avoid such 
dominated strategies when participating in the GSP auction. 

2.4 Signals and information asymmetry 

We define an extension of the setting above, incorporating a Bayesian version of information asymmetry. In 
this model, each player's type consists of a signal Sj drawn from an arbitrary signal space S. The signal of 
player i includes her valuation Vi(si) and can contain other privately-gained insight that refines the player 
z's conditional distribution over the space of other players' types and quality factors. The signals and quality 
factors come from a publicly known joint distribution (F', G), which can be arbitrarily correlated. 

In this model, a strategy is a bidding function that maps Sj, a signal, to a distribution of possible bids. 
The bid profile b is a Bayes-Nash equilibrium in the asymmetric information model if, for all i and all 
alternative bidding functions &/, 

E 8 _ ii7i b[tti(&i(si),b_i(s_i),7)|si] > E s _ ii7ib [it i (6<(s i ),b_i(s_i),7)|si] 
In this model, the Price of Anarchy is defined as 

E s , 7 [OPr(v(s), 7 )] 

sup 

F',G,b(-)eB7VE E s, 7 ,b(s) [SW(vr(b(s), 7), v(s), 7)] 

where BNE is the set of Bayes-Nash equilibria with respect to distribution F' over signals, with no overbid- 
ding. 

The presence of signals captures the notion that some advertisers might have a better potential to infer 
the other advertisers' valuations than others, or may be endowed with privileged information. We do note, 
however, that players do know their own valuations Uj(sj), and also are aware of the profile of bidding 
strategies b(-) and the distribution F', so that players can rationalize about the effects of signals upon the 
bidding behavior of their opponents. 

2.5 Repeated auctions and regret minimization 

We now consider the GSP auction in a repeated-game setting. In this model, the GSP auction is run T > 1 
times. We will distinguish between two variants of this model: the full information model and the model 
with uncertainty. 



9 



Full information model. Each round of the GSP auction occurs with the same slots and players. The 
private valuation profile v of the players and the quality factors do not change between rounds, but the 
players are free to change their bids. We write b\ for the bid of player % on round t. We refer to a D = 
(b 1 , . . . , b T , . . .) as an (infinite) declaration sequence. Given declaration sequence D, we will write D T 
to mean the prefix of D of length T; that is, D T = (b 1 , . . . , b T ). Given a (finite or infinite) declaration 
sequence D, we will write ir(D) for the sequence of permutations generated by GSP on input D. The 
average social welfare generated by GSP on a finite input sequence D T of length T is SW(tt(D t ), v) = 
Y J2t=i SW(7r(b*), v). The average social welfare generated by GSP on an infinite input sequence D is 
then defined to be SW(ir(D), v) = liminf T ^oo SW(ir(D T ), v). 

The full range of equilibria in such a repeated game is very rich, so we restrict ourselves to a particular 
non-equilibrium form of play that nevertheless captures the intuition that players learn appropriate bidding 
strategies over time, without necessitating convergence to a stationary equilibrium. 

We say that declaration sequence D = (b 1 , . . . , b T , . . .) minimizes external regret for player i if, for 
any fixed declaration b[, 

t<T t<T 

where R{T)/T — > as T grows large. That is, as T grows large, the utility of player i in the limit is 
no worse than the utility of the optimal fixed strategy in hindsight. The Price of Total Anarchy |4[ is the 
worst-case ratio between optimal social welfare and the average social welfare obtained by GSP across all 
declaration sequences that minimize external regret for all players. That is, the price of total anarchy is 

OPT(v) 

where the supremum is taken over (infinite) declaration sequences that minimize external regret for all 
players. 

It is known that the price of total anarchy is closely related to an equilibrium notion for the single-shot 
game known as coarse correlated equilibrium. We discuss this relationship in Section [5] 

Learning with uncertainty. Next we describe our model of learning in repeated GSP auctions with un- 
certainty. In this model, each round of the GSP auction occurs with the same slots, but the valuation profile 
v and quality factors 7 are redrawn from (F, G) on each roundel These changes to ad quality and types can 
be thought of as being due to the context of the search query that initiates each auction instance, which can 
change between rounds. 

Suppose that each player has a finite type spacd3- Let v* , 7* be the type profile and quality factors drawn 
at round t. Given a declaration sequence D and type Vi for player i, we denote by Vi) the subsequence of 
D consisting of the set of rounds in which player i has type Dj, i.e., Vi) = {t; v\ = vi}. Define I T (i, Vi) 
analogously with respect to D T . Given a sequence of type profiles and quality factors that represent the 
realization of these random quantities over the rounds of the auction, we say that player i has vanishing 
regret in declaration sequence D if player i has vanishing regret (in the sense of the full information game) 
on the subsequence I(i, V{) of D for each possible type V{. Formally: 

uMM-i,^)^ £ u^,b^,7<) + i*(|i T Mi)|) 

tei T (i,vi) tei T (i,vi) 

3 In fact, we can also think of the set of players as changing on each round: if player i is assigned type on a given round, this 
can be interpreted as player i not being present in that round. 

4 For instance, one could assume that valuations are bounded and multiples of some arbitrarily small increment. 
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for R(T)/T — > as T — > oo. Notice that since v* and v* are independent and identically distributed 
for each round, then \I T (i,Vi)\ — > oo as T — > oo. Now, we can define the Price of Total Anarchy with 
uncertainty as: 

sup hm sup = . 

{v*},{ 7 '},r> r^oo Lt<T 5 ^( 7r ( b t ,T)>v t >r) 

As in the full information setting, there is a relationship between regret minimization under uncertainty 
and coarse correlated equilibria with uncertainty. We defer a discussion of this relationship to Section [5] 



3 Price of Anarchy with Uncertainty 

Our main result is a bound on the price of anarchy for the Generalized Second Price auction with uncertainty. 
Recall that our model captures two types of uncertainty: uncertainty for player types and uncertainty about 
quality factors. Further, our result holds even in the presence of information asymmetry in the form of 
personalized signals available to the playersj^] For simplicity of presentation, we focus on the setting where 
there are no signals and player valuations and quality factors are drawn from a known joint distribution 
(F,G). 

Theorem 3.1 The price of anarchy of the Generalized Second Price auction with uncertainty is at most 
2.927. That is, for any fixed click-through-rates a±, . . . , a n , any joint distribution (F, G) over valuation 
profiles and quality factors, and any Bayes-Nash equilibrium h, 

Ev, 7 ,b[W(7T(b, 7 ),v, 7 )] > ^E V)7 [OPT(v, 7 )]. 

We will present a slightly weaker version of the above theorem, where we prove a bound of 3.164. The 
proof of the improved bound of 2.927, which is more technically involved, can be found in Appendix iBl 

Our proof is based on an extension of a proof technique introduced by Roughgarden 11241 . which he calls 
smoothness. We begin by reviewing this notion briefly in the context of a general game. Let t denote the 
(fixed) player types in a game, and h a pure strategy profile for the players, and let U{ (t, h) denote the utility 
of player i with player types t, and strategy profile h. Let sw(t, h) denote the social welfare generated by 
strategy profile h, and sw*(t) the maximum possible social welfare. Roughgarden defines (A, //)-smooth 
games as games where for all pairs of pure strategy profiles h, h', and any (fixed) vector of types t, we have 

^2 Ui(t, h' { , h_i) > A • sw(t, h')-n- sw(t, h). 

i 

Roughly speaking, smoothness captures the property that if strategy profile h' results in a significantly 
larger social welfare than another strategy profile h, then a large part of this gap in welfare is captured by 
the marginal increases in the utility of each individual player when unilaterally switching her strategy from 
hi to h^. 

It is not hard to see that GSP is not a smooth game (see Appendix IA1 for an example). However, we 
wish to argue that GSP does retain some of the properties of smooth games. We define a conceptually 
weaker but related property, and show that GSP satisfies this weaker property: there is a particular (possibly 

5 In the presence of additional signals, we can assume that signal s also encodes the valuation of the player, i.e., that player i's 
valuation for a click when she receives signal Si is Vi(si), and in this case, signals and quality factors are drawn from a known joint 
distribution (F', G). Our statement and proof carry over to this case with straightforward modifications. 
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randomized) strategy profile h' for which a smoothness-like inequality holds for any other pure strategy 
profile h. In other words, we prove the existence of a single bidding profile h' (depending on the types) 
that can be used by players unilaterally to improve the efficiency of GSP whenever its allocation is highly 
inefficient. 



Definition 3.2 (Semi-Smooth Games) We say that a game is (A, [i)-semi-smooth if for each player i there 
exists some (possibly randomized) strategy /i'j(-) (depending only on the type of the player) such that, 

Y^^h^Pi^h'iiti)^)} >\-SW*(t)-fl-8w(t,h), 

i 

for every pure strategy profile h and every (fixed) type vector t. The expectation is taken over the random 
bits ofh'^ti). 

Analogous to Roughgarden's |[24l proof, semi-smoothness also immediately implies a bound on the 
price of anarchy with uncertainty. 

Lemma 3.3 If a game is (A, p) -semi-smooth and its social welfare is at least the sum of the players' utilities, 
then the price of anarchy with uncertainty (and information asymmetries) is at most (fx + 1)/A. 



Proof. Consider a game in the Bayesian setting where player types are drawn from a joint probability 
distribution and let h be a Bayes-Nash equilibrium for this game. By the definition of the Bayes-Nash 
equilibrium, we have that Et_i,h[£^(t, h)|ij] > ¥<t_ u \ l \Ui(t 7 h' i (ti),h.-i)\ti\ for every value the random 
variable h'^U) may take. Hence, E t _ i>h [Ui(t, h)|fj] > E t _. )h E^( t .)[C/i(t, bJ^U), h_»)|£i]. Now taking 
expectation over t{, we get E ti h[^(t, h)] > E t ^h'.(ti)[Ui(t, h'^U), h_j)]. By summing over all players, 
and using the fact that the social welfare is at least the sum of the players' utilities, as well as the semi- 
smoothness property, we have 

E t)h [s^(t,h)] > E tjh [^[/ 4 (t,h)] 

i 

> E tl hEEfc{( ti) [yi(t J /ii(<i) > h_ i )]] 

i 

> Et,h[A-aio*(t)-/i-siw(t,h)] 
= XE t [sw*(t)}- fiK t)h [sw(t,h)}. 

Note that the third inequality follows by applying the semi-smoothness property for every fixed type vector 
and every pure strategy profile that are simultaneous outcomes of the random vectors t and h. The last 
inequality implies E t [sw;*(t)] < ^-E t h[su;(t, h)], as claimed. 

We remark that the proof holds without significant changes if we add information asymmetries in the 
game, i.e., if we assume that each player gets signals that reveal her type and refine her knowledge on the 
probability distributions of the types of the other players. All we need to change in this case is to replace the 
expectations over types with expectations over signals. ■ 

Notice that the usefulness of Lemma [331 lies in the fact that it can provide bounds on the efficiency loss 
for Bayesian games (and, as we will see later in Section [5J under even more general equilibrium concepts) 
by examining substantially more restricted settings. In the context of GSP auction games, it allows us to 
focus on identifying a (possibly randomized) deviating bid strategy for each player (i.e., a bid b\ for each 
player i) so that the semi-smoothness inequality holds for every fixed valuation vector v and pure bidding 
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profile b. By Lemma 1331 this then immediately implies a bound on the price of anarchy of GSP auction 
games with uncertainty and information asymmetries. 

We note that, technically speaking, the GSP auction does not immediately fit into the framework of 
semi-smoothness: advertiser payoffs depend on random quality factors which may be correlated with the 
type profile. However, this notational technicality is easily addressed by expressing advertiser utilities in 
expectation over quality scores. That is, expressing utilities in the GSP auction in the notation of general 
games, we have £7j(v, b) = E 7 [u,,(b, 7)|v]. Since quality factors affect the social welfare as well, we have 
sro*(v) =E 7 [OPT(v,7)|v] ands™(v,b) = E 7 [SVF(^(b, 7), v, 7 )|v]. 

We are ready to prove that GSP auction games are semi-smooth. 

Lemma 3.4 The GSP auction game is (1 — -, 1) -semi-smooth. 

Proof. We begin by rewriting the definition of semi-smoothness in the notation of GSP auctions. The GSP 
auction game is (1 — -, 1) -semi-smooth if and only if, for each valuation profile v, there exists a bid profile 
b' (with b' i depending only on the valuation of player i) such that, for every bid profile b, 

b_;, 7 )|v] > M-^E 7 [OPT(v,7)|v]-E 7 [W(7r(b,7),v, 7 )|v]. (1) 

We will actually establish the stronger inequality that, for all 7, 

J2uM,*>-i,7) > (l - £)0PT(v, 7) -W(vr(b, 7), v, 7). (2) 

The desired inequality CD) will then follow by taking (O in expectation over the choice of 7 (whose distri- 
bution may depend on the valuation profile v). 

Before establishing inequality ©, we will prove the slightly weaker statement that the GSP auction game 
is (1/2, 1) -semi-smooth (which implies a bound of 4 on the price of anarchy with uncertainty). Choose a 
vector v of fixed valuations, a pure bidding profile b, and quality factors 7. Consider a (deterministic) 
deviating bid b\ = Vi/2 for each player i. We distinguish between two cases (recalling that u(i) is the slot 
assigned to player i in the efficient allocation given v and 7): 

• If by bidding b\ player i gets slot or better, then ttj(i^, b_j, 7) > atuCiWiVi/^, as the payment pi 
cannot exceed her effective bid. 

• If by bidding b' ri player i gets a slot lower than v(i), then the effective value of the player Tv(u(i)) in 
slot v(i) is at least 7^/2, as we assume no overbidding. 

We conclude that, in either case, 

^(6-,b_i,7) > a u ^iVi/2 - 77r(i/(t)) 

Summing over all players, and noticing that J2i a nn{i) v ir(i) = SW(ir(h,j),v,~f), while J2i a u{i)7i v i = 
OPT(v, 7), we arrive at the claimed bound that the GSP auction game is (1/2, 1) -semi-smooth: 

J] mM, b_,, 7) > \0PT{v, 7) - SW(ir(b, 7), v, 7). 
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To improve the bound to (1 — - , 1) we consider a randomized bid b' rather than the deterministic bid of 
Vi/2 considered above. Bid b\ is arandom variable on [0, Uj] with density f(y) = for y £ [0, i>j(l — j)\ 
and f(y) = otherwise. We will show that 



E b ,M^b_ i57 )]> fl-i) 



(3) 



By summing expression ® for all i and using the fact that b n u) < v^u\ by the non-overbidding assumption, 
we obtain that the game is (1 — 1) -semi-smooth. 
It remains to derive equation 0]). We have that 



E 6 / b_j, 7)] > E 6 /[a ! ,( i )7j(^ - 6-)l{7i6- > 7ttO(0) }] 



/ 

.70 



a 



u(i)ii(vi - y)Hny > in(u(i))K(u(i))} 



1 



Vi-y 



-dy 



Vi 1 



7< 



> 1 



which implies ©, completing the proof of Lemma [341 

Combining Lemmas [3.31 and [3~4l we get the claimed bound on the price of anarchy. 



Theorem 3.5 The price of anarchy of the Generalized Second Price auction with uncertainty ( and with 
information asymmetries) is at most 2(1 — l/e)" 1 ~ 3.164. 



4 Pure Nash Equilibria in the Full Information Setting 

In this section we turn our attention to the full information setting, where the quality factors 7 are fixed and 
common knowledge. Without loss of generality we can assume that 71^1 > 72^2 > • • • > 7n^n- hi this 
setting the strategy of a player is a single bid 6j € [0, vi\, again assuming that players do not overbid. Our 
main result in this setting is the following: 

Theorem 4.1 The (pure) price of anarchy of the Generalized Second Price auction in the full information 
setting is at most 1.282. In other words, for any fixed click-through-rates a, valuation profile v, and quality 
factors 7, if h is a bid profile in pure Nash equilibrium, then SW(ir(h), v) > Y582 ' OPT(v) 0.78 • 
OPT(v). 

The bound above is very close to being tight, since we can exhibit an example with 3 players and 3 slots 
for which there is an equilibrium where the gap between the optimal social welfare and the social welfare in 
equilibrium is 1.259. Also, we can show the following slightly stronger bound for a small number of players 
and slots. Notice however that the bound in Theorem 14. 1 1 holds regardless of the number of slots. 

Theorem 4.2 For 2 players and 2 slots, the price of anarchy is exactly 1.25. For 3 players and 3 slots, the 
price of anarchy is exactly 1.259. By exactly we mean that there is a particular GSP auction game with an 
equilibrium matching this bound. 
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Proof. Here we give an example with two slots that yields price of anarchy 1.25. In Appendix we show 
that this is worst possible, and show the bound for 3 slots. 

For two slots, consider an example with two players with valuations 1 and 1/2 respectively, quality 
factors 71 = 72 = 1, and two slots with a\ = 1 and CK2 = 1/2. The bids b\ = and 62 = 1/2 are at 
equilibrium, resulting in a social welfare of 1, while the optimal social welfare is 1.25. ■ 

The full proof of Theorem 14.11 can be found in Appendix [Cj Here instead, we present the proof of a 
weaker bound that highlights the intuition underlying our result that GSP equilibria have good social welfare 
properties. 

Theorem 4.3 The (pure) price of anarchy of the Generalized Second Price auction in the full information 
setting is at most 2. 

The proof is based on the concept of weakly feasible allocations. Recall that each bid profile b defines 
an allocation ir that is a mapping from slots to players ir : [n] — > [n]. 

Definition 4.4 (Weakly feasible allocations) We say that an allocation ir is weakly feasible if the following 
holds for each pair i, j of slots: 

aj_ + 7ir(Q*MQ > L (4) 
We also use the term weak feasibility condition to refer to inequality ©. 

Lemma 4.5 If h is a Nash equilibrium of the GSP auction game, then the induced allocation ir satisfies the 
weak feasibility condition. 

Proof. If j < i the inequality is obviously true. Otherwise consider the player ir(j) in slot j. Since b is a 
Nash equilibrium, the player in slot j is happy with her outcome and does not want to increase her bid to take 
slot i, so: ajij^v^j) - 7 7r(i+1) 6 7r(i+1) ) > 0^(7,^)^(7) - T^i)^;)) since 6,^+1) > and b w{i) < 
then: ajj^v^ > a^j^v^ - 7 7r(i )U 7r (,)). ■ 

The concept of weakly feasible allocations encapsulates the fact that an allocation in equilibrium cannot 
be too far from the optimal. The optimal allocation is such that = i, since both {ai\ and {7^} are 
sorted. If an allocation is not optimal, then two slots i < j have advertisers assigned to them such that 
ir(i) > 7r(j), i.e., they are assigned in the wrong order. Equation (0]) implies that at least one of the two 
ratios is at least 1/2, and hence whenever advertisers are assigned in the non-optimal order, then either (i) 
the two advertisers have similar effective values for a click, or (ii) the click-through-rates of the two slots 
are not very different; in either case their relative order does not affect the social welfare very much. 

Given Lemma 1431 the proof of Theorem 14.31 follows almost directly: 

Proof of Theorem 14.31 Taking j = a(i) in the definition of weakly feasible allocations, we get that: 

^a(i)li v i + a iln(i) v Tr(i) > Oi7i^i- Now, summing this for each player i, we get 

2 • SW(7r(b), v) = ^2 a <r(i)H v i + ^2 a i77r(i)%(i) > ^2 ai ^ iVi = OPT(v). 

i i i 
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5 Quality of Learning Outcomes in GSP 



In this section, we bound the average quality of outcomes in a repeated play of a GSP auction game where 
players employ strategies that guarantee no external regret. In both the full information setting and the setting 
with uncertainty, we can reduce the problem over declaration sequences to a problem over distributions. This 
will allow us to adapt our earlier bounds on the price of anarchy from Sections [3] and [4] to bound the price 
of total anarchy. As in previous sections, we show simple and intuitive bounds in this section, and defer 
improved and more complex bounds to the appendix. 

5.1 Learning in the full information setting 

We will first focus upon the full information setting of the GSP auction. Recall that, in this model, the 
valuation profile v and quality factors 7 are fixed and common knowledge. As in the previous section, we 
will assume that 71 «i > 72^2 > • • • > 1 n v n - 

We will begin by proving a relationship between the price of total anarchy and the set of coarse corre- 
lated equilibria for the GSP auction in the full information model. Given a valuation profile v, a distribution 
D over bid profiles is called a coarse correlated equilibrium if 

Eb~D[«i(b)] > Eb^D^^b-;)],^. 

As we shall show, the price of total anarchy can be bounded by considering the social welfare generated at 
any coarse correlated equilibrium. 

Lemma 5.1 The price of total anarchy in the full information setting is at most 

OPT(v) 
vJSce E b „ D [SW(7r(b),v)] 

where CCE is the set of coarse correlated equilibria. 

Proof. Consider a declaration sequence D = (b 1 , . . . , b*, . . .) in the full information case. For each T let 
D T be the distribution over bid profiles where each b* for t < T is drawn with probability ^. Proving that 
the price of total anarchy is bounded by r\ is equivalent to showing that: 

liminfE b ^ D T[W(7r(b),v)] > -OPT(v). 

T 7] 

Since the set of all possible bid profiles is compact, one needs to prove that for all distributions D such that 
there is a subsequence of {D T }^ converging in distribution to D we have: 

E b „ D [SW(7r(b),v)] > -OPT(v). 

V 

It is therefore sufficient to show that such a D is a coarse correlated equilibrium. We note that the fact that 
the declaration sequence D minimizes external regret implies that, for each distribution D which can be 
written as the limit of a subsequence of {D t }t, it holds that 

Eb~DMb)] > Eb^DM^b-;)],^ 

as required. ■ 

Using this connection to coarse correlated equilibria, we are able to obtain a bound of 2.310 on the price 
of total anarchy of the GSP auction. 
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Theorem 5.2 The price of total anarchy of the Generalized Second Price auction in the full information 
setting is at most 2.310. 

A full proof of Theorem 15.21 appears in Appendix [D] We now present a simpler proof of the following 
weaker bound, which captures some of the intuition behind the proof of Theorem 15.21 

Theorem 5.3 The price of total anarchy of the Generalized Second Price auction in the full information 
setting is at most 3. 

Proof. The proof can be thought of as an improved version of the price of anarchy bound based on the 
fact that the GSP auction game is (1/2, 1) -semi-smooth. We consider a distribution D which corresponds 
to a coarse correlated equilibrium. All expectations in the following are taken with respect to b ~ D. 
Recall the outline of the bound of 4 on the price of anarchy based on the fact that the GSP auction game is 
(1/2, 1) -semi-smooth. We considered a possible deviation for player i with valuation Vi to bid b\ = Vi/2, 
and concluded the bound iij(^,b_j) > a^iVi/2 — oti^n^b^u^ in the proof of Lemma 1341 We use the 
no-regret inequality directly, to get that 

E[ui(b)] > mjiVi/2 - E[a i 7 n -( i )6 7r ( i )]. 

Using that b^u) < v^u), an d summing over all players we get a bound of 4 on the price of total anarchy as 
was done in Lemma [331 

Here we improve this bound by adding two new ideas. First, note that for all slots except the top one 
a iln(i)b-n(i) is a lower bound to the payment of the player in slot i — 1. The social welfare is the sum of 
player utilities and the payments. The inequality states that in expectation the utility of player i plus the 
payment of the player in slot i — 1 is at least dijiVi/2, i.e., half of the social welfare contributed by player i 
in the efficient solution. To turn this into a bound on social welfare, we need to handle player 1 differently, 
as ai7 7r n)6 7r m does not correspond to any payment. 

The second observation is that for player 1 we can obtain a stronger bound on her utility by considering 
the deviation b[ = v\ . For other players such a high bid would endanger them to get a slot much higher than 
their slot in the optimum at a very high price. But player 1 already gets the best slot in the efficient solution. 
Deviating to b\ = v\ will give the player the top slot, and hence utility ai7i«i — (Xi^mb^ny Now using 
the no-regret property we get 

E[«i(b)] > ai7iui - Ejan^!)^)]. 

By summing over all players and writing the social welfare as the sum of utilities plus the total payments, 
we get: 

E[W(vr(b),v)] =E[J> i (b)] lEgaiKi+D^i)] 

i i 

> ^EMb)] + ^E[n i (b)] +Eg> i77r(i) & 7rW ] 

i>2 i>2 

ai7i«i E[ai7 7r(1) 6 7r(1) ] aaiVi 

> — 2 2 ^ ~ 2^ E [ Q!i 'M*) fe *(0] + Z^^<*Mi) b *(W 

i>2 i>2 i>2 

= ^OPT(v) - iE[ai 7w( i)6 ff (i)]. 
Since E[SW(ir(b), v)] > E[ai7 7r(1) 6 7r(1) ], we obtain that E[5W / (vr(b), v)] > \OPT(y). ■ 
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5.2 Learning with uncertainty 

Let us now turn to the model of learning outcomes with uncertainty. As in the full information model, we can 
define a Bayesian version of the coarse correlated equilibrium. A Bayesian coarse correlated equilibrium is 
a mapping from valuation profiles v to distributions over bid profiles D(v) such that 

E v, 7 ,b~D(v) [ui (b, 7) \vi] > E Vi7ib ~D(v) K (K > h ~i > 7) I Vi] , Vi, v { , b ■ . 

Similarly to Lemma 1570 we can show that the price of total anarchy with uncertainty can be bounded by 
considering the social welfare generated at any Bayesian coarse correlated equilibrium. 

Lemma 5.4 The price of total anarchy with uncertainty is at most 

E v , 7 opr(v, 7 ) 

sup 

F,G,D(.)eCC£: E v, 7 ,b~D(v) [SW (ir(b) , v , 7)] 

where CCE is the set of Bayesian coarse correlated equilibria. 

The arguments in the proof of Lemma [331 can be used with essentially no change to show that (A, //)- 
semi-smoothness implies a bound of (/i + 1)/A to the price of total anarchy with uncertainty. From this, we 
know that: 

Theorem 5.5 The price of total anarchy of the Generalized Second Price auction with uncertainty is bounded 
by 3.164. 

In Appendix El we give the following improved bound. 

Theorem 5.6 The price of total anarchy of the Generalized Second Price auction with uncertainty is bounded 
by 2.927. 
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A GSP auction games are not smooth 

Theorem A.l The GSP auction game is not (A, fj,) -smooth for any parameters A, //. 

Proof. Consider a GSP auction game with 2 slots with click-through-rates 1 and a < 1, and two advertisers 
with valuations 1 and v. Let s = (61,62) and s* = (63,64) where 1 > v > 62 > 63 > 64 > b\ > 0. For 
this case, the expression J2i u i{ s i-, s ~i) > XSW(s*) — p J SW(s) (according to Roughgarden's definition of 
smoothness [24]]) becomes: 

a(l - 0) + l(v - 61) > A(l + av) - fi(v + a), 

which yields 

(l + fi)(a + v) > X(l + av), 

for 61 > 0. Given any A, p, > 0, there exist arbitrarily small a and v so that the above inequality is violated; 
hence, the GSP auction game is not (A, /i)-smooth. ■ 

B Improved Bounds for Games with Uncertainty 

In this section, we prove Theorems 13. Il andl 5.6l The idea of the proof is analogous to our proof of the bound 
of 3.164 in Section[3l based on semi-smoothness, but will use a modification of semi-smoothness specially 
tailored to the GSP auction game, analogous to the way we modified the simple bound of 4 derived using 
the (1/2, l)-semi-smoothness of GSP to a bound of 3 on the price of total anarchy for the full information 
case in Section [5] We handle the case when a player has the highest effective value separately, and show 
that there exists a bidding profile b' such that the following inequality holds. 

ElJ^uMiv^h-i,^] > /3E[OPT(v, 7 )] - (1 + 5) £ E[a i77r(i) 6 7r(i) ] + E[ai 77r(1) 6 7r(1) ]. (5) 

i i 

This inequality is analogous but weaker than claiming that GSP is (/3, 5) -semi-smooth, yet we will show 
that it implies that the price of anarchy (and the price of total anarchy) is bounded by ^Mp. This connection 
is stated in the next lemma. 
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Lemma B.l Assume that for every GSP auction game there is a bidding profile b' and parameters (3,5 > 
such that inequality (0 holds for any strategy profile b. Then, the price of anarchy of the Generalized 
Second Price auction with uncertainty is at most ^Mp. The same bound applies to the price of total anarchy 
with uncertainty as well. 

Proof. Consider a Nash equilibrium bidding profile b. Clearly, E[«j(b, 7)] > E[«i b_j, 7)] by 
selecting the bidding profile b' as in inequality ©. We use this inequality and the fact that the social 
welfare is the sum of the expected utilities of the advertisers plus the total payments to get 

E[5W(7r(b(v), 7), v, 7)] = u ^ 7)] + E E ^7^+1)^+1)] 

i i 

i i 

> (3E[OPT(v, 7)] -(1 + 5)J2 na lMi) b n{l) ] + E[ai 7w(1) 6 w(1) ] 

i 

i>2 

= /3E[OPT(v, 7)] - dJ2 na lMi) b n{i) ] 

i 

> /3E[OPT(v, 7 )] - <5E[5^(7r(b(v), 7 ),v,7)], 

which implies that the price of anarchy is at most ^Mp, as desired. To get the same bound for the price of 
total anarchy, consider a coarse correlated equilibrium b instead of a Nash equilibrium. ■ 

The next lemma (Lemma [B.3l ) connects inequality © to the existence of functions with particular prop- 
erties which we call (/?, <5)-bounded functions. 

Definition B. 2 Let (3,6 > and g : [0,1] — > M+. Function g is {(3, 5) -bounded if the following three 
properties hold: 

i) [ g(y)dy < 1, 







ii) (l-z) J g(y)dy>(3-5z, V*e[0,l], 
Hi) f (l-y)g( y )dy> (3-(l + 5)z, VzG[0,l]. 

J z 

Recall that the proof of Lemma [3~4l that GSP is (1 — — , 1) -semi-smooth relied on a random distribution 
with density f(y) = for y G [0, (1 — ^)] and f(y) = otherwise, and considered the bid distribution 
b'i = yvi for player i with valuation v\. The improved proof in Lemma lB3l uses a {(3, (5)-bounded function 
g in place of this /. 

Lemma B.3 Let (3,5 > be such that a ((3, 5)-bounded function exists. Then, there is a bidding profile h' 
such that inequality (T5]) holds for any strategy profile b. 
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Proof. In the proof we consider a GSP auction game with n slots with click-through-rates a\ > a% > . . . > 
OL n > and n conservative players with random valuations v\ , , ■ ■ ■ , v n > and random quality factors 
7i> 72) • • • > 7n > 1- Let b denote any bid profile. Also, we denote by b' the bid profile such that b\ (x) is the 
most profitable deviation for player i when her valuation is v; t = x. We will prove inequality © using this 
definition for b'. 

The proof is long and technical. Before presenting it, we give a high-level overview. We apply the 
following three steps: 

• Step 1 : We focus on advertiser i with valuation v i = x and obtain a lower bound on her expected utility 
E[nj(6^, b_j, j)\vi = x] when deviating to b'^x). The main idea we use here is that the deviation to 
bid b'iix) is more profitable for advertiser i than deviating to the bid yx, for every y S [0, 1]. This 
yields infinitely many lower bounds on E[ttj(^, b_«, ^)\vi = x]; we combine them in a single lower 
bound by taking their weighted average, with weights indicated by the values of a (/?, 5) -bounded 
function g. 

• Step 2: We further refine the lower bound on E [m (b^ , b_ j , 7) | v i = x] . Here, we reason about the slots 
advertiser i would occupy by deviating to bid yx and the utility she would then have, and we use the 
properties of (/?, <5)-bounded functions. We consider slot 1 and slots i > 2 separately, as we did in the 
proof of Theorem [531 

• Step 3: We use the bound obtained in Step 2 in order to compute a lower bound for the total expected 
utility of all players when deviating to b'. We first lower-bound the unconditional expected utility of 
advertiser i and, then, we simply sum the obtained inequalities over all advertisers in order to obtain 
inequality ©. 



Step 1: Focus on player i and let x be a possible valuation for this player. Let (3,5 > and consider a 
(/?, <5)-bounded function g : [0, 1] — > R+. Using the first property in Definition IB.2l for g and the fact that 
is the most profitable deviation for advertiser i, we have 

E[u i (b' i (x),b-i,j)\v i = x] > / g(y)E[ui(bi(x),b-i,j)\vi = x]dy 



> / 9(y)^[ui(yx,b-i,^)\vi=x]dy. 
Jo 

Given any slot j, let Ax denote the event that V{ = x and v(i) = j and B l x ° denote the event that v(i) = j 
given that Vi = x. Using these definitions, we can rewrite the quantity E[ui(yx, b_j, 7)1^ = x] for every 
y G [0, 1] as 



E[u i (yx,b- i , 1 )\v i = x] = ^EluiC^b-i^JI^'l-PtH 
By the last two (in)equalities, we obtain that 



„i n 

E[ Ui {b' i {x\b- hl )\v i =x\ > / 5 (y)V;E[« i (yx,b_ i ,7)|4 j ]-W]dy 

Jo j=1 

n „i 

= V/ g(y)E[u l (yx,b. ul )\A^]dyF[B l J]. (6) 
~T J 
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Step 2: Our puipose now is to refine the lower bound provided by inequality ©. Let 7r'(b_j, i) be the 
player with the 2-th highest effective bid in b_j. 

First consider slot 1 separately. Assume that the event A^ is true, i.e., V{ = x and v{i) = 1. We 
will first lower-bound the quantity E[ui(yx,b_j,7)|^ :] for every y G [0,1]. By deviating to bid yx, 
player i is allocated the first slot whenever ^yx > 7jr i m^m; in this case, player i has utility at least 
ai(%x - 7^(1)^(1))- Hence, 

E[ui(yx,b_i,~f)\A^] > E[ai(7iX - 7 7r * (1) & 7rI(1) )l{7 i yx > 7^(i)^(i)}l^ 1 ]- 
We set 2; = 2z!W ZLlffl anc [ use this last inequality to obtain 



! g(y) • E[ai 7i x(l - z)l{y > z}\A^} dy 
Jo 

E[amx(l - z) [ g(y)t{y> z}dy\A*} 



1 

g(y)E[u i (yx,b- i ,j)\A^}dy > 



= K[a lllX (l-z) J g(y)dy\A£] 

> E[ai(/3 7i x- ^(i)^(i)) I4 1 ] (7) 

where the second inequality follows by the second property of Definition IB . 21 for function g (and using the 
definition of z). 

Now, assume that the event Ai is true for j > 2, i.e., Vi = x and v(i) = j. We will lower-bound the 
quantity E[ui(yx, b_j, 7 )|>la ] for every y £ [0, 1]. By deviating to bid yx, player i is allocated slot j (or a 
higher one) whenever ^iyx > l^tj^^iij), * n tn i s case > player i has utility at least a^ix{l — y). Hence, 

E[ui(yx,b- h j)\A%] > E[a j7i x(l - y)t{-fiyx > -f nl{j) b nl{j) }\A^]. 
We set z = 77rI(j) n * (j) and use this last inequality to obtain 



7;Z 
1 



dy > ( g(y)E[a j7i x(l-y)t{y> z}\A^]dy 
Jo 



g{y)E[u i {yx,h- i ^)\A % i\ 

= E[ ajli x J {I- y)g(y) dy\A%] 

> E[ aj (Pnx - (1 + d) l7ri(j) b^ U) ) \A l i\. (8) 

The second inequality follows by the third property of Definition lB.2l for function g (and using the definition 
of z). 

We now use inequality © together with the lower bounds for J g(y)E[ui(yx, b_j, 7)!^'] dy obtained 
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in © and ([U). We have 

E^O^b-^K = x] > E[ai (/3 7l x - ^(i)^(i)) • nBl 1 } 

n 

+ J^Efa, (/3 7l x - (1 + 5) 7 ^ (i )6 7r!(i) ) |4 J 1 • m 



3=1 
n 

= I3j2n^i^} • P[B«] - 5E[a llni{1) b ni{1) \A%] • Pf^ 1 ] 

n 

- (1 + 5) • P[^]. 

i=2 

Step 3: We can now bound the unconditional expected utility of player i when deviating to strategy b'^Vi) 
by integrating over the range of valuations for player i and using the last inequality obtained in Step 2. In 
the following we use f Vi (x) to denote the probability density function of the random variable V{. We have 



Ef^O-O^b.;^)] = / E[it i (6-(u i ),b_ i ,7)] • f Vi (x)dx 

Jo 

n poo 
,=1 J ° 



3 

fOO 



3=2 

Now, we use the property 



poo 

5 / n^il^i)b^ (l) \A^} ■ FiBi 1 ] ■ f Vi (x) dx 
Jo 

n poo 

(1 + *) £ / n^uMjM^ ■ P ^'] • dx. 
~k Jo 



E[Z\A%] ■ F[I?J] ■ f Vl (x) dx = E[Z\v{i) = j] ■ F[v(i) = j], 
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for any random variable Z as well as the fact that j w i(j\b^i(j\ < T^m^m to obtain that 

n 

> ^E[ aj r/iVi\u(i) = j] ■ F[u(i) = j] - ^^(^(^(i) = 1] • P[i/(i) = 1] 

j'=i 

n 

- (1 + <5)X) E [ a W)^0-)l l/ W = 3) ■ P K*) = j] 

i=2 

n 

> p^Efa-yiVilvit) = j] ■ ¥[u(i) = j] - SE[a lMl) b vil) \u(i) = 1] • F[u(i) = 1] 

i=i 

n 

- (i + s^H^uAuM^ = i\ ■ p K*) = j] 

i=2 

n n 

= /3^E[a jW Ki) = j] • = j] - (1 + S^n^wK^Wi^ = j] ■ = i] 

j=i i=i 

+ E[ai7 w(1) 6 w(1) |i/(0 = l]-%(i) = l] 
= ^E[a l/(i )7 i i; i ] - (1 + S)E[a u{i) j 7T ^ i)) b n{u{i)) ] + E[ai7 7r(1) 6 7r(1) |i/(i) = 1] • F[v(i) = 1]. 

By summing over all players, we obtain inequality (f5]). In particular, 

^E[u;(&^0;),b_i,7)] > J^E^^Wi] - (1 + 5) ^ E [ a i/(i)7ir(i/(0) 6 ir(«/(0)] 

+ ^EhT^mKi) = i] • pko = 1] 

i 

= /?E[OPT(v, 7 )] - (1 + S) E[aa Ai) K {i) ) + E[q i7 , (1) ^ (1) ]. 



Therefore, by Lemmas IB . 1 1 and IB . 3 1 in order to bound the price of anarchy, it suffices to find a (/?, 6)- 
bounded function such that the ratio is as low as possible. This is the puipose of the following lemma. 



Lemma B.4 Consider a function g : [0, 1] — > R+ defined as follows: 

[ j^, ye[0,A), 



g{y) 



{K - 1)[ y\ y€[A,/i), 



0. 



where n > 1 ara<f 1 > /i > A > smc/z 



1-A 



(k — l)/x + kA > 0. 77ie«, y(y) ij arc ((k — l)fi, K — 1) -bounded function. 
Proof. We begin by computing f Q g(y) dy. It holds that 



y e [a*, i], 

;ln(l - A) < 1, and (« - 1)(1 - //) In ±± 



g(y) dy 



1-1/ 



dy + 



^ (« - 1)(1 - At) 

(i-y) 2 



(k-IKm-A) 
1 - A 



Kln(l — A) < 1, 
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where the inequality holds by the first assumption concerning ft, A and /x. Hence, g satisfies the first property 
of Definition E2 

For the second property of Definition lB.2l it suffices to prove that 

(l-z)[ g{y)dy + (n-l){z- t i) > 0, to G [0,1]. 



We distinguish between three cases depending on z. First, we consider the case that z G [//, 1]. We have 
(1-z) [ g(y)dy + ( K -l){z-n) = (ft - l){z - /i) > 0, 



l 1 g(y)dy + ( K -l)(z-f i ) = (1-z) T {k / / )( \/ ) dy+ (ft - l)(z - M ) = 0. 



where the inequality holds since z G [/i, 1] and ft > 1. For z G (A, /x) we have 

(i-y) 

Finally, for z G [0, A] we have 

(1-z) J g(y)dy + (K-l)(z-/i) 

dy+ 1-z / n 

1-1/ A (i - y) 



(1-z) / ^dy+a-z)/ V ^ A „ A2 ^ dy + (K-l)(z- M ) 



, , 1 — z (1 — z)(k — l)(u — X) 
= (l-z)Kln— + K - '- + (k-1)(z-ij.) 

> (k - l)(fi - \) + (k - 1)(\ - (i) 
= 0, 

where the inequality follows by the fact that the derivative with respect to z is negative for z G [0, A]. Hence, 
it holds that g satisfies the second property of Definition IB .21 

It remains to prove that g satisfies the third property of Definition lB.21 Similarly, it suffices to prove that 

J (l-y)g(y)dy-(K-l)n + Kz > 0, to 6 [0,1]. 

Again, we distinguish between three cases depending on z. First, we consider the case that z G [/x, 1]. We 
have 

J (1 - y)g(y) dy-(n- l)fi + kz = - (ft - l)n + kz > yL > 0, 
where the first inequality follows since z G [fi, 1]. For z G [A, fi) we have 

f 1 f^ 1 ( K ~ 1)(1 — A*) 
y (1 - y)5(y)dy - (ft - 1)/U + kz = y - dy - (k - + ftz 

1 — z 

= (ft — 1)(1 — it) In (k — l)it + /tz 

> (ft- 1)(1 - u)lnl — --(ft-l)/i + ftA 

1 - A» 

> 0, 
26 



where the first inequality follows by the fact that the derivative with respect to z is strictly positive for 
z £ [A, n), and the second inequality follows by the second assumption concerning k, A and /j,. Finally, for 
z £ [0, A) we have 



f f f 1 (k - 1)(1 - fl) 
J (! - y)a(y) dy-(K-l)fi + KZ = J ndy + _ ^ 2 dy- (k- l)/i + kz 

= (k — 1)(1 — ii) In — — (k — l)/i + kX 

1 - n 

> o, 

where the inequality follows by the second assumption concerning k, A and fi. The proof of the lemma is 
complete. ■ 

We are now ready to complete the proof of Theorems 13. II and 15.61 The two conditions of Lemma IBT41 
are satisfied for k = 1.7507, A = 0.225, and fi = 0.7966. By combining Lemmas ED El and El 
we conclude that the price of (total) anarchy of GSP auction games over Bayes-Nash equilibria is at most 
■r^jy- < 2.9276. 



C Improved Bounds for Pure Nash Equilibria 

In this section we present our results for pure Nash equilibria in the full information setting (Theorems 14.11 
and 14.21 ). For simplicity of exposition, we consider all quality factors to be equal to 1; so, 7 does not appear 
in notation. Our proofs can be adapted to different quality factors in a straightforward way. We consider GSP 
auction games with n advertisers with valuations v\ > . . . > v n > and n slots with click-through-rates 
ai > . . . > a n > 0. We assume that neither all slots have the same click-through-rate nor all advertisers 
have the same valuation (in both cases, the price of anarchy is 1). 

We use the term inefficiency of allocation n to refer to the ratio OPT(v) / SW (it, v). In Lemma 1431 we 
showed that every pure Nash equilibrium corresponds to a weakly feasible allocation. Hence, the price of 
anarchy of a GSP auction game over pure Nash equilibria is upper-bounded by the worst-case inefficiency 
among weakly feasible allocations. 

Definition C.l An allocation ir is called proper if for any two slots i < j with equal click-through-rates, it 
holds ir(i) < vr(j'). 

Clearly, for any non-proper weakly feasible allocation, we can construct a proper weakly feasible one with 
equal social welfare. Hence, in order to prove our upper bounds, we essentially upper-bound the worst-case 
inefficiency over proper weakly feasible allocations. 

Given an allocation ir, consider the directed graph G(ir) that has one node for each slot, and a directed 
edge for each advertiser i that connects the node corresponding to slot i to the node corresponding to slot 
-K~ l {i). In general, G(tt) consists of a set of disjoint cycles and may contain self-loops. 

Definition C.2 An allocation ir is called reducible if its directed graph G(ir) has more than one cycles. 
Otherwise, it is called irreducible. 

Given a reducible allocation tt such that G(tt) has c > 2 cycles, we can construct c GSP auction subgames 
by considering the slots and the advertisers that correspond to the nodes and edges of each cycle. Similarly, 
for £ = 1 , . . . , c, the restriction 7/ of tt to the slots and advertisers of the ^-th subgame is an allocation for 
this game. The next fact essentially states that we can focus on irreducible allocations. 
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Fact C.3 If allocation tt is weakly feasible for the original GSP auction game, then 7r e is weakly feasible for 
the i-th subgame as well, for (. = 1, . . . , c Then, the inefficiency ofir is at most the maximum inefficiency 
among the allocations tt^ for t = 1, . . . , c. 

When considering irreducible weakly feasible allocations, we further assume that the index of the slot 
advertiser 1 occupies is smaller than the index of the advertiser that is assigned to slot 1. This is without 
loss of generality due to the following argument. Consider an irreducible weakly feasible allocation tt for 
a GSP auction game with n advertisers such that 7r _1 (l) > tt(1). We construct a new game with click- 
through-rate a[ = Vi for slot i and valuation v[ = a, for advertiser i, for i = l,...,n, and the allocation 
7T* = tt~ 1 . Observe that 7r !| r 1 (l) = 7r(l) < 7r _1 (l) = 7r*(i). Clearly, the optimal social welfare is the 
same in both games while the social welfare of tt* for the new game is SW(tt*, v') = J2i a i v 'n U) = 
Yli v i a n t (i) = a n- 1 (i) v i = SW(tt, v). We can also prove the weak feasibility conditions for tt* in the 
new game for each i, j. In order to do so, consider the weak feasibility condition for tt in the original game 
for advertisers 7r(j), 7r(i). It is djV^^ > a^v^^ — v^u)) and, equivalently, tV(i) a i > v ir(j)( a i ~ a j)- 
By the definition of the click-through-rates and the valuations in the new game and the definition of tt*, we 
obtain that a' - 1/ . y v! i > a' _ lr ..(v ' — v'f] as desired. 

We furthermore note that when v n = 0, any proper weakly feasible allocation is reducible. This is 
obviously the case if all advertisers with zero valuation use the last slots. Otherwise, consider an advertiser i 
with non-zero valuation that is assigned a slot vr _1 (z) > 7r _1 (j) where j is an advertiser with zero valuation. 
Since the allocation is proper, it holds that a^-in\ < a n -i/j\. Then, we obtain a contradiction by the weak 
feasibility condition a n -i^Vi > a w -i^(vi — Vj) for advertisers i, j. 

C.l GSP auction games with two and three advertisers 

We now complete the proof of Theorem 14.21 

We begin by presenting the matching upper bound on the price of anarchy for two advertisers and two 
slots. The upper bound follows by bounding the inefficiency of weakly feasible allocations. Consider a GSP 
auction game with two slots with click-through-rates a\ > a2 = f3oti, for j3 G [0, 1] and two advertisers 
with valuations v\ > V2 = Xv±, for A £ [0, 1]. The only non-optimal weakly feasible allocation tt assigns 
advertiser 1 to slot 2 and advertiser 2 to slot 1. Its social welfare is SW(7r, v) = 01^2+02^1 = <X\V\ (/3+A), 
while the optimal social welfare is OPT(v) = ct\V\ + «2^2 = ol\V\{\ + /3A). Furthermore, the weak 
feasibility condition for advertiser 1 implies that 02^1 > ol\(v\ — v-i), i.e., j3 > 1 — A. We have that 

OPTjy) = 1 + /3A l + (/3 + A) 2 /4 
SW(tt,v) p + X ~ /3 + A ~ 1 

where the first inequality holds since the product j3X is maximized when /3 = X = (f3 + A)/2 and the second 

inequality holds since /3 + A € [1,2] and the function 1+x x ^ is non-increasing in x £ [1,2]. 

For the case of three advertisers, we again present a tight bound on the price of anarchy. We first present 
the upper bound. Consider a GSP auction game with three slots with click-through-rates a\ > 012 > 03 > 
and three advertisers with valuations v 1 > i>2 > v 3 > and a proper weakly feasible allocation tt of slots 
to advertisers. We will prove the theorem by upper-bounding the inefficiency of tt by 1.259134. If tt is 
reducible, then the inefficiency is bounded by the inefficiency of games with two advertisers (see Fact IC .3b 
and the theorem follows by the upper bound of 5/4 proved for this case. So, in the following, we assume 
that tt is irreducible; by the observation above, this implies that V3 > 0. There are only two such allocations 
which are in fact symmetric: in the first, slots 1, 2, 3 are allocated to advertisers 3, 1, 2, respectively, and in 
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the second, slots 1, 2, 3 are allocated to advertisers 2, 3, 1, respectively. Without loss of generality (see the 
discussion above), we assume that ir is the former allocation. 

Let /3, 5, A, and ii be such that Q2 = f3ct\, = 5a±, V2 = Xv%, and V3 = llv\. Clearly, it holds that 1 > 
(3 > 5 > and 1 > A > ll > 0. The social welfare of allocation n is SW(ir, v) = aiVi(fi+/3+5\) whereas 
the optimal social welfare is OPT(v) = ait>i(l + /3A + <5/i). Furthermore, since ir is weakly feasible, the 
weak feasibility conditions for advertisers 1 and 3 and advertisers 2 and 3 are a2V\ > — ^3) and 

(X3V2 > o.\{v2 — V3), respectively, i.e., (3 > 1 — ll and 5 > 1 — We are now ready to bound the 
inefficiency of it. Let e, 9 > be such that /3 = 1 — + e and 5=1 — ^ + 9. We have 

OPT(v) 1 + pX + dn _ 1 + X-llX + ll-^ + €X + 9li 

SW(tt,v) ~ ll + (3 + 5X ~ 1 + X- fi + e + 9X 

< i + A-/xA + /i-^ 
1 + A-/X 



,- 



The inequality follows since 1 > A > /i > implies that 1 + A — l\lX + ll — ^- = 1 + A — ll + — A) + 
- ll/X) > 1 + A - Li > 1 and e + <9A > eA + 9li > 0. 

For /z G [0, 1], this last expression is maximized for the value of ll that makes its derivative with respect 
to ll equal to zero, i.e., ll = — a/A 3 + 1 + A + 1. By substituting \x, we obtain that 

OPT(v) £ A» + A + 2-2^TT £ j + 2( . = L2M134 



SW(n,v) ~ X 
where ( = 0.129567 and the second inequality follows by the following lemma (Lemma [C .41 ) . 

Lemma C.4 Let ( = 0.129567. For any X G [0, 1], it holds that VX 3 + 1 > 1 - (A + 4p 

Proof. Since both parts of the inequality are non-negative for AG [0, 1], it suffices to show that the function 

/(A) = (A 3 + 1) - (l - CA + is non-negative for A G [0, 1]. Let g(X) = + (1 + C)A 2 - (1 + 



C 2 )A + 2C and observe that /(A) = A -5(A). The proof will follow by proving that g(X) > when A G [0, 1]. 
Observe that the derivative of g is strictly negative for A = and strictly positive for A = 1. Hence, the 

minimum of g in [0, 1] is achieved at the point A* = 4+4 £ 2 V^ + 8 C+i w jj ere ^ derivative of g becomes 
zero. Straightforward calculations yield that g(X*) > and the lemma follows. ■ 

In the following we prove that the above analysis is tight. Consider a GSP auction game with three 
advertisers with valuations v± = I, v% = 0.5296, and v% = 0.14583, respectively, and three slots with 
click-through-rates ot\ = 1, «2 = 0.55071, and 03 = 0.4704, respectively. Let b = (61,62,^3) be a bid 
vector with b\ = 0, 62 = V2 = 0.5296, and 63 = ^3 = 0.14583, respectively. So, advertiser 2 is allocated 
slot 1, advertiser 3 is allocated slot 2, and advertiser 1 is allocated slot 3. We refer to this allocation as ir. It 
is not hard to verify that b is a pure Nash equilibrium, and that the price of anarchy is given by: 

OPTjy) = a x v\ + a 2 t>2 + «3^3 > 1 259133 
SW(ir,v) 01^2 + 02^3 + 0=3^1 ~~ 

The proof of Theorem l4.2l is complete. 
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C.2 GSP auction games with many advertisers 

We now prove Theorem 14- 1 1 In order to do so, we will actually prove the stronger claim that the worst- 
case inefficiency among weakly feasible allocations of any GSP auction game is at most r = 61+ 1 7 2 '^^ ~ 
1.28216. We use induction. As the base of our induction, we use the fact that GSP auction games with one, 
two, or three advertisers have worst-case inefficiency among weakly feasible allocations at most 1.28216. 
For a single advertiser, the claim is trivial. For two or three advertisers, it follows by the proof of Theorem 
14.21 Let n > 4 be an integer. Using the inductive hypothesis that the worst-case inefficiency among weakly 
feasible allocations of any GSP auction game with at most n — 1 advertisers is at most r, we will show that 
this is also the case for any GSP auction game with n advertisers. 

Consider a GSP auction game with n advertisers with valuations v\ > v 2 > . . . > v n > and n slots 
with click-through-rates a± > 02 > . . . > a n > and let it be a proper weakly feasible allocation. If n is 
reducible, the claim follows by Fact IC. 31 and the inductive hypothesis. So, in the following, we assume that 
7r is irreducible; this implies that v n > 0. Let j be the advertiser that is assigned slot 1 and i\ be the slot 
assigned to advertiser 1. Without loss of generality, we assume that i\ < j since the other case is symmetric; 
see the discussion at the beginning of Section O Also, let i 2 be the slot assigned to advertiser i\. By our 
assumptions, the integers j, 1, i\, and i 2 are different. 

We will show that 

SW(ir,v) > a lVj + afaOi - ^) + a i2 {v h - Vj ) - — + 0PT ^ , (9) 

Once we have proved inequality (O, we can obtain the desired relation between SW(-7r, v) and OPT(v) 
using the following technical lemma. 

Lemma C.5 Let r = 61 + 7 2 ^ Tf w 1.28216 and f(fi, 5, A, /i) = p + p (l - + <5(A - /i) - i. Then, the 
objective value of the mathematical program 

minimize f(f3, 5, A, fx) 
subject to (3 > 1 — fj, 

5 > 1 - fi/X 
1 > A > n > 
1 > f3,5 > 

is non-negative. 

Proof. Since \i < A < 1, we have that f(/3,5, A, /i) is non-decreasing in ft and 5. Using the first two 
constraints, we have that the objective value of the mathematical program is at least 

\ As r r \ r J A 

which is minimized f or fi = A — 4- to 



A 2 A . . A 2 \ 1 A A 2 A 3 



/ l-A + -,-,A,A-- = 1 + 

2r 2r 2r r r r 4r z 
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In order to complete the proof it suffices to show that the function g(X) = 1 — £ — 7 + 7 — jpi is non- 
negative for A 6 [0, 1]. Observe that g(X) is a polynomial of degree 3 and, hence, it has at most one local 
minimum. Also observe that the derivative of g(X) is — £ + ^ — which is strictly negative for A = and 
strictly positive for A = 1. Hence, its minimum in [0, 1] is achieved at the point A* = 4r - 2 v / 4r 2 3 3r w here 
the derivative becomes zero. Straightforward calculations yield that g(X*) = and the lemma follows. ■ 

So, assuming that (O holds, we can apply Lemma |C31 with /3 = a^/ax, 5 = a>i 2 /a>i, X = v^/vi, and 
fj, = Vj/vi. Clearly, the last two constraints of the mathematical program in Lemma |C31 are satisfied. Also, 
observe that the weak feasibility conditions for advertisers 1 and j and advertisers i\ and j in allocation it 
are Ui x V\ > a\{v\ — Vj) and a^v^ > cti(vi 1 — Vj), respectively, i.e., f3 > 1 — \i and 6 > 1 — fx/X and the 
first two constraints of the mathematical program in Lemma IC5l are satisfied as well. Now, using inequality 
© and Lemma |C31 we have that 

\ ot\ ot\ V\ V\ J r r 

and the proof follows. 

It remains to prove inequality d9]). We distinguish between three cases depending on the relative order 
of j,ii, and i^, in each of these cases, we further distinguish between two subcases. In each case, we 
exploit the structure of allocation it to reason as follows. We consider a restriction of the original game 
(i.e., a different "restricted" game) by removing some advertisers from the original game and the slots they 
occupy in n. The particular advertisers to be removed are different in each case. We denote by tt* the 
restriction of allocation ir to the advertisers and slots of the restricted game. We also use v' to denote the 
valuation profile in the restricted game; so, SW(ir', v') denotes the social welfare of ir' in the restricted 
game. An important observation is that ir' is a weakly feasible allocation in the restricted game since the 
weak feasibility conditions for it' are just a subset of the corresponding conditions for it (for the original 
game). Furthermore, the restricted game has at most n — 1 advertisers and, by the inductive hypothesis, we 
know that the inefficiency of ir' is at most r. Then, inequality (O follows using this fact and by carefully 
expressing the optimal social welfare in the new game. 

Case 1.1: 1 < i\ < j < ii and otj < ai 2 r. Consider the restriction of the original game that consists of 
the advertisers different than j, 1, and i\ and the slots different than 1, i\, and %2- Let it' be the restriction of 
it to the advertisers and slots of the new game and let v' be the restriction of v to all advertisers besides j, 1 
and i\. Clearly, it' is weakly feasible for the new game since the weak feasibility conditions for it' are just a 
subset of the corresponding conditions for it (for the original game). Also, note that the efficient allocation 
for the restricted game assigns advertiser k to slot kfork = 2, . . . , i± — 1, i\ + 1, . . . , j — 1, ii + 1, . . . , n and 
advertiser k + 1 to slot k for k = j, . . . , %2 — 1. By the inductive hypothesis, we know that the inefficiency 
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of 7r' is at most r. Hence, we can bound the social welfare of n as 

SW(tt,v) = a 1 v j +a h v 1 + ai 2 v il + ^ ot k v n ^ 

= aiVj + a.i x V\ + a i2 v ix + SW(7r', v') 

/ ii— 1 j'-l i2-l n 

> aiUj + ai^i + a i2 v h + - a k v k + ^ a fc t> fc + ^ afcffc+i + ^ 



fc=2 fc=ii+l fc=j fc=i2+l 

il— 1 J— 1 «2 n 

"A: 

fc=2 fc=ii+l k=j+l k=i 2 + l 

aiVj + a^ui + a^t^ + - I } j a k v k - aivi - a^v^ - ajVj I 



^ / il— 1 J-l %2 n 

> aiVj + a h vi + ai 2 v h + - I ^2 a k v k + ^ a k v k + ^ c^u*. + ^ 



> a 1 v j + a il (v 1 ) + a^lv^ - Vj) h 

J qp if* rp 

and inequality (O follows. The first inequality follows by the inductive hypothesis and the definition of 
the efficient allocation for the restricted game. The second inequality follows since a k > a k+ i for k = 
j, . . . ,12 — I. The last inequality follows since ay < ai 2 r. 

Case 1.2: 1 < i\ < j < 12 and aj > ai 2 r. We use the restriction of the original game that consists of the 
advertisers different than j and 1 and the slots different than 1 and i ± . Now, the efficient allocation for the 
restricted game assigns advertiser k to slot k for k = 2, . . . , i\ — 1, j + 1, . . . , n and advertiser k — 1 to slot 
k for k = it + 1, . . . , j. Using the inductive hypothesis for the restriction n' of it to the restricted game, we 
can bound the social welfare of n as 

SW(ir,v) = aiv j + a il vi+ ^ a k v n ^ 

= aiVj + a ix vi + SW(ir', v') 

^ Ai-l j n 

> aiVj + a^vi + - afcUfc + ^ Ofc^fc-i + ^ «fc 

yfc=2 fc=u+i fc=j+i 

1 / n j 

= aiVj + a h vi + - \^^a k v k - aivi - a^v^ + a k (v k -i-v k ) 

\k=l k=ii+l 



1 / " J 



,fc=l fc=ii + l 



1 OPT(v) 
= aif j + a^ui - - [aivi + a^f^ + ajVj - ajV ix ) H 

■ / "in, , \ . OPTiy) 

> aiUj +Q!i 1 (Ul ) + Oi 2 {v ix - Vj) 1 

and inequality © follows. The first inequality follows by the inductive hypothesis and the definition of the 
efficient allocation for the restricted game. The second inequality follows since a k > aj and v k -i — v k >0 
for A; = ii + 1, . . . , j. The last inequality follows since aj > ai 2 r. 
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Case II.1: 1 < i\ < %i < j and Vi 2 < vjr. We use the restriction of the original game that consists of the 
advertisers different than j, 1, and i\ and the slots different than 1, i\, and i^. Now, the efficient allocation 
for the restricted game assigns advertiser k to slot k for k = 2, . . . , ii — 1, i\ + 1, . . . , i% — l,j + 1, . . . , n 
and advertiser k — 1 to slot k for k = %2 + 1, . . . ,j. Using the inductive hypothesis for the restriction ir' of 
7r to the restricted game, we can bound the social welfare of tt as 

SW(tt,v) = aiVj + a h vi + a i2 v h + ^ a k v n ^ 

= aiVj + Oii x v\ + a i2 v ix + SW(tt', v') 

^ / ii— 1 *2— 1 j n 

> ait>j + qf^i + o^t^ + - ^ Qifc^fc + ^ afc"Vfc+ ^ afc«fc-i + X] 

\fc=2 fe=ii+l fc=i 2 +l k=j+l 

j / ii— 1 i2-l j n 

> ai«j + a^vi + cKigUij + - a k y k + ^ a k y k+ ^2 akVk+ ^2 akVk 

k=2 fe=ii+l fc=« 2 +l fc=i+l 



ai«j + a h vi + aia^! + i 1^2 akVk ~ aiVl ~ Qi i^i ~~ a i2 v i2 J 

Vi, . ai^i OPT(v) 
> aiUj + aj^ui-— j + ajjj^ii 1 

and inequality (O follows. The first inequality follows by the inductive hypothesis and the definition of 
the efficient allocation for the restricted game. The second inequality follows since v^-i > Vf. for k = 
i2 + 1, • • • ,j. The last inequality follows since Vi 2 < Vjr. 

Case II.2: 1 < i\ < %2 < j and Vi 2 > vjr. We use the restriction of the original game that consists of 
the advertisers different than 1 and i\ and the slots different than i\ and 12. Now, the efficient allocation for 
the restricted game assigns advertiser k to slot k for k = 12 + 1, • • • , n, advertiser i\ + 1 to slot i\ — 1, and 
advertiser k + 1 to slot k for k = 1, . . . , i\ — 2, + 1, ■ ■ ■ , «2 — 1. Using the inductive hypothesis for the 
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restriction it' of ir to the advertisers and slots of the restricted game, we can bound the social welfare of tt as 



SW(n,v) = a^vx + a i2 v h + ^ 

= a h vi + a i2 v^ + SW(tt', v') 

^ / ii-2 12 — 1 n 

> a h vi + ai 2 v h + - I a k Vk+i + a h -iv il+ i + ^ afcffc+i + ^ 

\fc=l fc=n+l fc=j 2 +l 

Y ( n i\-2 

= a h V! + a i2 v h + - I ^ afc^fc + ^ ( afc ~ "fc+iH+l + - ai 1+ iM 1+ i 



r 

i 2 -i 



fc=il+l 

1 / n ii-2 

> aj^i + a^w^ + - I ^ "fc^fc + ^ (afe - a fc+ i)u i2 + (a^-i - a^+i)^ 



\fc=i fc=i 

12 — 1 

+ ^ - a k+1 )v i2 - aivi - a^v^ 

1 . . OPT(v) 

= a^vi + [aivi + a^t^ + ai 2 v i2 - a\Vi 2 ) -\ 

r r 

Uj, ai^i OPT(v) 
> ai«j + ) + a i2 {v il -Vj) 1 

and inequality © follows. The first inequality follows by the inductive hypothesis and the definition of 
the efficient allocation for the restricted game. The second inequality follows since a k — a k +i > and 
Vk+i > vi 2 for k = 1, . . . , i\ — 2, i\ + 1, . . . , 12 — 1 and a^-i — oti 1+ \ > and i^+i > vi 2 . The last 
inequality follows since vi 2 > vjr and a± > ai 2 . 

Case III.1: 1 < 12 < i\ < j and Vi 2 < vjr. We use the restriction of the original game that consists of the 
advertisers different than j, i\, and 1 and the slots different than 1, 12, and i\. Now, the efficient allocation 
for the restricted game assigns advertiser k to slot k for k = 2, . . . , — 1, j + 1, . . . , n advertiser i\ — 1 
to slot i\ + 1, and advertiser k — 1 to slot k for k = i% + 1, • • • , i\ — 1, %\ + 2, . . . ,j. Using the inductive 
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hypothesis for the restriction tt' of tt to the restricted game, we can bound the social welfare of tt as 

SW(tt,v) = aiVj + a i2 v h + a h vi + ^ a k v^ 

= aiVj + OjjjUij + a^vi + SW(tt', v') 

, / i2— 1 ii— 1 

> a^j + a i2 v h + a h vi + - ^ ^2 a k v k + ^ afc^-i + a h+1 v il ^ 1 



k=2 k=i 2 +l 



3 n 

+ ^2 a k V k~l + ^2 ®kVk 
k=h+2 k=j+l 



y/c=2 fc=« 2 +i fc=«i+i 

1 

= ai^j + a^^jj + a h vi + - I 2^ cn k v k - aivi - a i2 v i2 - a^v^ 

r \k=l 

> ai-Uj + ctjj (yx ) + a i2 {v il -Vj) 1 



and inequality © follows. The first inequality follows by the inductive hypothesis and the definition of 
the efficient allocation for the restricted game. The second inequality follows since v k _i > v k for k = 
i2 + 1, • • • , i\ — 1, h + 2, . . . , j and v^-i > i^+i. The last inequality follows since V{ 2 < Vjr. 

Case III.2: 1 < %2 < i\ < j and v- l2 > Vjr. We use the restriction of the original game that consists of the 
advertisers different than i\ and 1 and the slots different than %2 and i\. Now, the efficient allocation for the 
restricted game assigns advertiser k to slot k for k = 12 + 1, . . . , i\ — 1, i\ + 1, • • • , n and advertiser k + 1 
to slot k for k = 1, . . . , 12 — 1. Using the inductive hypothesis for the restriction tt' of tt to the restricted 
game, we can bound the social welfare of tt as 

SW(tt,v) = a^v^ + a h vi + a k v w ^ 

kg{i2,h} 

= a i2 v h + a h vi + SW(tt',v') 

^ / h — 1 ii— 1 n 

> a i2 v h + a h vi + - j a kVk+i + ^2 a k v k+ yZ 



OikVk 

k=l k=i 2 +l fc=ii+l 

n 12 — 1 



Oia^ii + &hVl + - "fc^fc - ai^l - OLi x Vi Y + ^2 ( a k ~ ®k+l)vk+l 

\k=l k=l , 

Y ( n * 2_1 



\fc=i fc=i 

1 . . OPT(v) 
= Oijfij + a^vi - - (aivi + OLi x Vi x + aj 2 -yj 2 - ai-y i2 ) H 

ai^i OPT(v) 

> ai^j + a^ivi — ) + aii^Vi-L — Vj) 1 
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and inequality (O follows. The first inequality follows by the inductive hypothesis and the definition of 
the efficient allocation for the restricted game. The second inequality follows since a.\. — ctk+\ > and 
Vk+i > Vi 2 for k = 1, . . . , %2 — 1. The last inequality follows since Vi 2 > Vjr and a\ > ai 2 . 
The proof of Theorem 14. H is complete. 

D Improved Bounds for Learning Outcomes 

In this section, we focus on the full information game. For simplicity we assume that all quality factors 
7i = 1, and assume that players are sorted so that v\ > V2 > • • • > v n (all proofs extend to the case with 
general quality factors by considering effective values 7^ in place of valuations everywhere). 

The main goal of this Appendix is to prove Theorem 15.21 Similarly to the proof of Theorem 13.11 in 
Appendix |B1 for Bayes-Nash equilibria, the proof considers a player i with valuation Vi, possible bids of the 
form yvi, and uses the fact that the player has no-regret about such alternative bids. In the full information 
case, we can handle the player with top valuation separately, and will only use that this player 1 has no regret 
about bidding her actual valuation v\. For any other player i, the proof is analogous to the proof of Theorem 
13. 1 1 in Appendix IBl However, we no longer have to consider separately the case when the player's optimal 
slot is 1. This allows us to drop one requirement for the function g in the definition IB .21 We further simplify 
that definition by setting 5 = (3 (we have verified that different values for 5 do not yield any improvement). 
More formally, we will need the following definition. 

Definition D.l Let (3 e (0, 1]. A function g : [0, 1] — > M+ is called (3-bounded if the following two 
properties hold: 

[ s(y)dy<i, 

J o 

it) / (1 - y)g(y) dy > (3 - (1 + /3)z, tee [0,1]. 

J z 

The following lemma states the connection of the price of anarchy to the existence of /3-bounded func- 
tions. 

Lemma D.2 Let (3 G (0, 1] be such that a (3-bounded function exists. Then, the price of total anarchy of the 
Generalized Second Price auction in the full information setting is at most 1 + 1/(3. 

Proof. In the proof, we consider a GSP auction game with n slots with click-through-rates a\ > «2 > 
• • • > ct n > and n conservative players with valuations vi,V2, ■ ■ ■ ,v n > 0. Let b denote the bids of the 
players at a coarse correlated equilibrium. 

We begin by lower-bounding the expected utility of each player at a coarse correlated equilibrium. We 
first consider player 1 and her deviation to the bid v\ . Then, player 1 would always be allocated slot 1 and 
would pay the highest bid among the remaining players (which is at most fr^m) per click. By the definition 
of the coarse correlated equilibrium such a deviation does not increase her expected utility (as the player has 
no regret), i.e., 

EMb)] > EtmCi^b-j)] > E[«^i -6 W(1) )] > /Saiui - /3E[ ai ^ (1) ], (10) 

where the last inequality follows since v\ > b^m and since (3 £ (0, 1]. Now, consider the deviation of 
player i to the deterministic bid b\ < Vi. Then, she would be assigned to slot i or higher and would get 
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utility at least ai(v{ — b^) when the z-th highest bid is smaller than b^. Again, by the definition of the coarse 
correlated equilibrium such a deviation does not increase her expected utility, i.e., 

E[«i(b)] > EM&^b^)] > E[ai(vi - fyl{K(i) < &*}]• 
Using the first property in Definition ID. II for g as well as the last inequality (with b\ = yvi) , we have 



E[«i(b)] > f g(y)-E[ Ui (b)]dy 
Jo 



> 







g(y) • E[ai(vi - yvi)t{b n{i) < yv^} dy 



E[aiVi / (1 - y)g(y)l{b n{i) < yvi}dy] 



''[am / 

JK(i)/vi 



E[aiVi I (1 - y)g(y)dy]. 

r(i)/ 

We now apply the second property of Definition ID. 11 for function g to obtain 

E[ Ui (b)] > E[p ai Vi - (1 + P)aib n(i) ] = pa iVi - (1 + P)E[a i b 7r{{) }. (11) 
By summing over all players and using inequalities ( fTOl ) and (fTTT >. we have 
^EMb)] = E[ Ul Cb)] + ^E[«i(b)] 

i i>2 

> pYI aiVi -( 1+ p)Y1 E M-«] + 

i i 

= f30PT(v) - (1 + p) ^E^^)] + E[ ai b n{1) ]. 

i 

Now, we use this last inequality in the same way we used inequality © in the proof of Lemma |B~T1 By 
the fact that the social welfare is the sum of the expected utilities of the players plus the total payments, we 
obtain 

E[SW(7v(b),v)\ = E^UiCbJl+E^Oib^i+i)] 

i i 

> /30PT(v) - (1 + 0) J2 HaiKii)] + E[a 1 6 7r(1) ] + ]T E[ ai b n{l) ] 

i i>2 

= POPT(v) - p^Efab*®] 

i 

> pOPT{w) - pE[SW(ir(b),v)], 

which implies that the price of total anarchy OPT(v) /E[5W(7r(b), v)] is at most 1 + 1//3, as desired. ■ 

We are ready to complete the proof of Theorem 15.21 By Lemma lD72l it suffices to find a /3-bounded 
function with (3 as high as possible. Let A « 0.4328 be the solution of the equation 1 — A + In (1 — A) = 
and g : [0, 1] — > be the function defined as follows: 

9[V) I 0, ye(\,l] 
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We will show that g is /3-bounded for /3 = jzr\', the upper bound of 1/A « 2.3102 stated in Theorem 15.21 
will then follow. 

Indeed, by the definition of A, we have 



i.e., g satisfies the second property of Definition ID. li as well. 

E Irrational and Partially Rational Players 

In this section we consider the effect of partial rationality on the welfare generated by the GSP auction. We 
first consider a setting in which the players are not necessarily perfect utility optimizers, but rather can only 
be assumed to apply strategies that form an approximate equilibrium. We then study a setting in which some 
fraction of the players bid arbitrarily, without any rationality assumptions beyond avoiding the dominated 
strategy of overbidding (see Section I2.3I ). In both cases, we find that the social welfare guarantees of the 
GSP auction degrade gracefully with the degree of irrationality present in the players. 

E.l Approximate equilibria 

We will consider the social welfare generated by the GSP auction with uncertainty when players play only 
approximately utility-maximizing strategies. In Section [3l we assumed that rational players apply strategies 
at equilibrium. However, due to limits on rationality or indifference between small differences in utility, it 
may be the case that players converge only to an approximate equilibrium. We begin by defining this notion 
formally. Given a joint distribution (F, G) over types and quality factors, we say that strategy profile b is an 
e-Bayes-Nash equilibrium for distributions F, G if, for all players i, all types Vi, and all alternative strategies 



^v-iv,h[ui{h(vi),h-i(\-i),-i)\vi] > (1 - e)E v _ ii7i b[iii(6' i K) 5 b_i(v_i),7)|^]. 

Notice our choice of the multiplicative definition of approximate equilibria, justified by the fact that we have 
chosen not to scale values to lie in [0,1]. 

We define the e-Bayes-Nash Price of Anarchy to be 



where e-BNE is the set of all e-Bayes-Nash equilibria. 

We now claim that our bound for social welfare at (non-approximate) equilibrium degrades gracefully 
as we relax the degree to which a bidding strategy only approximates an equilibrium. 



Theorem E.l The e-Bayes-Nash price of anarchy of the Generalized Second Price auction is at most 
1.2553 + (1 - e)" 1 • 1.6722. 




Hence, g satisfies the first property of Definition lD.il We also observe that 




E v , 7 [OPr(v, 7 )] 



E Vi7ib(v) [5W(7r(b(v), 7 ),v, 7 )] 
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The intuition behind Theorem lE.ll is that the bound for exact equilibria obtained in Theorem B. 1 I depends 
on the Bayes-Nash equilibrium condition in a continuous way. This continuity is captured by the semi- 
smoothness of the GSP auction (as well as by inequality §5$, used to prove Theorem 13. II in Appendix IB1. 
Indeed, the following Lemma follows by a trivial modification to the proof of Lemma IbTTI 

Lemma E.2 Assume that for every GSP auction game there is a bidding profile h' and parameters (3,8 > 
such that inequality (f5]) holds for any strategy profile b. Then the e-Bayes-Nash price of anarchy of the 
Generalized Second Price auction is at most ^~ £ o +S . 

It then follows immediately from Lemmas |B .31 and |B~4l (see Appendix IB1 that the e-Bayes-Nash price of 
anarchy of the GSP auction is at most ^-^07^7^^ ~ 1.2553 + (1 — e) _1 • 1.6722. 



E.2 Irrational players 

We now consider a setting in which, of the n advertisers who bid in the GSP auction, some subset of them 
are "irrational" and cannot be assumed to apply strategies at equilibrium. We still think of the irrational 
advertisers as being true players in the GSP auction, with valuations and quality scores. The irrational 
advertisers simply may not apply rational bidding strategies; for example, they may not have experience 
with the GSP auction, or not know about historical bidding patterns. 

Our setting will be an extension of the GSP auction with uncertainty. We will first provide some defini- 
tions. Given valuations v, quality scores 7, an outcome n, and a set S of players, the social welfare restricted 
to set S is SWs(tt, v, 7) = J2ies a -K- 1 {i)li v i> ^e total value of the outcome it for the advertisers in S. The 
optimal social welfare restricted to S is OPTs(v, 7) = max,,- SWs(tt, v, 7). 

Given a joint distribution (F, G) over types and quality factors, and a set S of players, we say that 
strategy profile b is an 5-Bayes-Nash equilibrium for distributions F, G if, for all players i 6 5, all types 
Vi, and all alternative strategies 6/, 

E v _, l) ~f ) b[ui(bi(v i ) 1 b^i(v_i),>y)\vi} > E v _ ii7|b [ttj(£4(?;j), b_;(v_j), j)\vi\. 

That is, no player in S can improve her utility by modifying her bid, but no such restriction is imposed upon 
the players outside S. 

We will show that, for each set S of players, the total expected social welfare obtained by GSP at an 
5-Bayes-Nash equilibrium is a good approximation to E[OPTs(v)]. We can interpret this result as stating 
that the addition of irrational players does not significantly degrade the social welfare that would have been 
generated had they not participated. Note that we cannot hope to always obtain a good approximation to 
E[OPT(v)] (the optimal social welfare of all advertisers) at all S'-Bayes-Nash equilibria; for example, it 
may be that the valuations of the players outside S are very large, but they choose (irrationally) to bid 0. 

We note that our no-overbidding assumption (Section 12.31 ) continues to apply to all players, not only 
to the players in S. In other words, we require that bi(vi) < V{ for all i $ S and all V{. We feel this is a 
natural restriction to impose even on "irrational" advertisers, as overbidding is an easily-avoided dominated 
strategy. Moreover, it is arguable that inexperienced advertisers would bid conservatively, and not risk a 
large payment with no gainO 

6 This relies on the simplifying assumption that all advertisers have knowledge of their own private valuations. Admittedly, this 
requires a certain level of sophistication and may be difficult to attain in practice. Our argument is thus limited to imperfect strategy 
choice given perfect knowledge of types. It remains open to extend this analysis to players who may misunderstand their own 
valuations. 
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Formally, given a non-empty subset S of advertisers, we define the S-Bayes-Nash Price of Anarchy to 

be 

E v , 7 [OPr s (v, 7 )] 

sup 

V,G,h(-)eS-BNE E V!7!b(v )[SW(7r(b(v),7),v,7)] 

where S-BNE is the set of all 5-Bayes-Nash equilibria. 

Our main result is the following extension of Theorem l3.ll 

Theorem E.3 For any non-empty subset S of rational advertisers, the S-Bayes-Nash price of anarchy of 
the Generalized Second Price auction is at most 2.927. 

In order to prove this theorem, we need an inequality similar to inequality (f5]) in Section |B] In particular, for 
every bid profile b, there exists a bid profile b' defined on the rational advertisers such that 



£E[ui(&J(i/i),b_i,7)] > /3E[OPT 5 (v,7)]-(l + 5)^E[a, W 7 7r(Ki)) ^ (l/(i)) ]+E[a 1 7 7r(1) 6 7r(1) ]. (12) 
ieS ies 

We can prove inequality ([121 by following the same steps as in the proof of Lemma llOl and by consider- 
ing the utilities of the rational players at their most profitable deviation. Here, v(i) should be interpreted as 
the slot the rational advertiser i occupies in the efficient allocation restricted to 5 and ir(j) is the advertiser 
that occupies the j-th slot in allocation tt (this advertiser can be rational or irrational). Similarly, Tr l (j) is the 
player with the j'-fh highest effective bid among all advertisers besides the rational advertiser i. 

All the arguments hold in this case as well. However, there is a minor point that should be justified. 
Observe that in order to obtain inequalities © and dU), we used the fact that the j-th highest effective 
bid (excluding advertiser i) is not larger that the effective value of advertiser i when v(i) = j, i.e., when 
slot j is allocated to advertiser i in the efficient allocation restricted to 5. When adapting the proof to the 
case of rational and irrational players, it may be the case that v(i) = j when the rational advertiser i has 
valuation Vi = x but 7 7r i( 3 -)& 7r »(j) > %x. This may be due to the fact that player 7r l (j) is one of the irrational 
players. Fortunately, both inequalities ((TJ) and © are obviously true in this case as well. Observe that {3 < 5 
(otherwise, the second property of Definition IB . 21 would not hold for z = 1) and the right-hand side of both 
inequalities is non-positive. The changes in the rest of the proof of Lemma 1531 are minor. 

Then, Theorem IE.3I follows by the next lemma that exploits inequality (fT2)) and using the same values 
for j3 and 5 that we used in Section iBl 

Lemma E.4 Assume that for every GSP auction game with a non-empty set S of rational players there is 
a bidding profile h' for the players in S and parameters {3,5 > such that inequality A12\) holds for any 
strategy profile h. Then, the S-Bayes-Nash price of anarchy of the Generalized Second Price auction is at 
most 

Proof. Consider an 5-Bayes-Nash equilibrium b. Define b' as in inequality (fT2~l) and observe that E[itj(b, 7)] > 
E[itj(&^(«j), b_i, 7)] for every player i G S. We use this inequality and the fact that the social welfare is at 
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least the sum of the expected utilities of the rational advertisers plus the total payments to get 

E[SW(7r(b(v), 7 ),v, 7 )] > £EMb, 7 )] + E E ^(i+i)^(i+i)l 

i£S i 

> ^E[ni(6-(^),b_i,7)] +^E[a!i7 7r ( i )6 7r ( i )] 

> /3E[OPT 5 (v, 7)] - (1 + S) ^2 ^u{i)lir{y{i))K{u{i))\ + E[oi7 ff(1 )6 w(1 )] + E[ai7 ffW 6 ffW ] 

> /3E[OPT s (v, 7 )] - (1 + <J)^E[a i 7 7rW 6 7rW ] + ^E[a i7 ^A W ] 

j « 

= /3E[OPT s (v, 7 )] - 5j2n<XiMi) b *(i)} 

i 

> /3E[OPT s (v,7)] - <5E[5T7(7r(b(v),7),v,7)], 

which implies that the S*-Bayes-Nash price of anarchy is at most as desired. ■ 
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